To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 俄??v俄??vB 1000100111100010001111110011111101110110100010011110001000111111001111110111011001000010 89e23f3f7689e23f3f7642
EUC-JP 俄??v俄??vB 1011001011100100001111110011111101110110101100101110010000111111001111110111011001000010 b2e43f3f76b2e43f3f7642
UTF-8 俄겼ㄷv俄겼ㄷvB 111001001011111110000100111010101011001010111100111000111000010010110111011101101110010010111111100001001110101010110010101111001110001110000100101101110111011001000010 e4bf84eab2bce384b776e4bf84eab2bce384b77642
UHC 俄겼ㄷv俄겼ㄷvB 111001001010110110110000111001011010010010100111011101101110010010101101101100001110010110100100101001110111011001000010 e4adb0e5a4a776e4adb0e5a4a77642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)