To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 也??嗚?????B 100101101110011100111111001111111001101001101010001111110011111100111111001111110011111101000010 96e73f3f9a6a3f3f3f3f3f42
EUC-JP 也??嗚?????B 110011001110100100111111001111111101001111001011001111110011111100111111001111110011111101000010 cce93f3fd3cb3f3f3f3f3f42
UTF-8 也좊젌嗚멧펯溜볡찂B 11100100101110011001111111101100101000101000101011101100101000001000110011100101100101111001101011101011101010011010011111101101100011101010111111101111101001111000101111101011101100111010000111101100101100001000001001000010 e4b99feca28aeca08ce5979aeba9a7ed8eafefa78bebb3a1ecb08242
UHC 也좊젌嗚멧펯溜볡찂B 11100101101001011010000011101011101000001000110111100111111100001011100011100100101111001000000111101010111111101001001111100111101010011000011001000010 e5a5a0eba08de7f0b8e4bc81eafe93e7a98642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)