To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 嚥≪?以??誘? 100110101000101110000001111000010011111110001000110010000011111100111111100101110101010100111111 9a8b81e13f88c83f3f97553f
EUC-JP 嚥≪?以??誘? 110100111110101110100010111000110011111110110000110010100011111100111111110011011011011000111111 d3eba2e33fb0ca3f3fcdb63f
UTF-8 嚥≪늾以곫에誘┙ 111001011001101010100101111000101000100110101010111010111000101010111110111001001011101110100101111010101011001110101011111011001001011110010000111010001010101010011000111000101001010010011001 e59aa5e289aaeb8abee4bba5eab3abec9790e8aa98e29499
UHC 嚥≪늾以곫에誘┙ 11100110101111111010000111101100100010001000011111101100101001001000000111100110101111111010000111101011101011111010011011000100 e6bfa1ec8887eca481e6bfa1ebafa6c4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)