To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 額??倚?????阿???霓??泣hぜ 1000101001111010001111110011111110011000110111110011111100111111001111110011111100111111100010001010001000111111001111110011111111101000101111010011111100111111100010111000001110000010100010001000001010111010 8a7a3f3f98df3f3f3f3f3f88a23f3f3fe8bd3f3f8b83828882ba
EUC-JP 額??倚??洹??阿???霓??泣hぜ 10110011110110110011111100111111110100001110000100111111001111111000111111000111101110100011111100111111101100001010010000111111001111110011111111110000101111110011111100111111101101011110001110100011111010001010010010111100 b3db3f3fd0e13f3f8fc7ba3f3fb0a43f3f3ff0bf3f3fb5e3a3e8a4bc
UTF-8 額곕쵎倚묊댆洹잆걶阿쇥뗫떈霓띰퐢泣hぜ 111010011010000110001101111010101011001110010101111011001011010110001110111001011000000010011010111010111010110010001010111010111000110010000110111001101011010010111001111011001001111010000110111010101011000110110110111010011001100010111111111011001000011110100101111010111001011110101011111010111001011010001000111010011001110010010011111010111001110110110000111011011001000010100010111001101011001110100011111011111011110110001000111000111000000110011100 e9a18deab395ecb58ee5809aebac8aeb8c86e6b4b9ec9e86eab1b6e998bfec87a5eb97abeb9688e99c93eb9db0ed90a2e6b3a3efbd88e3819c
UHC 額곕쵎倚묊댆洹잆걶阿쇥뗫떈霓띰퐢泣hぜ 1110010011111110101100001110101110101100100100001110101111101111100100011110011110001000101100001110101010110111100111111110001110000001100111001110010010111001100110011101000110001011111010111000101110011110111001111110011110110110111011111011110110001011111010111110100010100011111010001010101010111100 e4feb0ebac90ebef91e788b0eab79fe3819ce4b999d18beb8b9ee7e7b6efbd8bebe8a3e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)