To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳶????ぜ揄?? 100100111100111000111111001111110011111100111111100000101011101010011101100010010011111100111111 93ce3f3f3f3f82ba9d893f3f
EUC-JP 鳶??靷?ぜ揄?? 1100011011010000001111110011111110001111111001111011110100111111101001001011110011011001111010010011111100111111 c6d03f3f8fe7bd3fa4bcd9e93f3f
UTF-8 鳶롫끏靷뽬ぜ揄쒕꽱 111010011011001110110110111010111010000110101011111010111000000110001111111010011001110110110111111010111011110110101100111000111000000110011100111001101000111110000100111011001001001010010101111010101011110110110001 e9b3b6eba1abeb818fe99db7ebbdace3819ce68f84ec9295eabdb1
UHC 鳶롫끏靷뽬ぜ揄쒕꽱 111001101110100110001110111010111000010110111111111011001110011010010110111010001010101010111100111010101111000110011100111010111000010010111100 e6e98eeb85bfece696e8aabceaf19ceb84bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)