To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 啼??私燾蒸?中?啼??私燾蒸?中?B 1001101001100101001111110011111110001110100001001111101101011010100011111111011000111111100100101000011000111111100110100110010100111111001111111000111010000100111110110101101010001111111101100011111110010010100001100011111101000010 9a653f3f8e84fb5a8ff63f92863f9a653f3f8e84fb5a8ff63f92863f42
EUC-JP 啼??私燾蒸?中?啼??私燾蒸?中?B 11010011110001100011111100111111101110111110010010001111110010101011110110111110111110000011111111000011111001100011111111010011110001100011111100111111101110111110010010001111110010101011110110111110111110000011111111000011111001100011111101000010 d3c63f3fbbe48fcabdbef83fc3e63fd3c63f3fbbe48fcabdbef83fc3e63f42
UTF-8 啼닸렫私燾蒸렡中렠啼닸렫私燾蒸렡中렠B 11100101100101011011110011101011100010111011100011101011101000001010101111100111101001111000000111100111100001111011111011101000100100101011100011101011101000001010000111100100101110001010110111101011101000001010000011100101100101011011110011101011100010111011100011101011101000001010101111100111101001111000000111100111100001111011111011101000100100101011100011101011101000001010000111100100101110001010110111101011101000001010000001000010 e595bceb8bb8eba0abe7a781e787bee892b8eba0a1e4b8adeba0a0e595bceb8bb8eba0abe7a781e787bee892b8eba0a1e4b8adeba0a042
UHC 啼닸렫私燾蒸렡中렠啼닸렫私燾蒸렡中렠B 11110000101001101011010011100110100011101011100111011110111001111101010010100111111100011111101010001110101100101111000111101001100011101011000111110000101001101011010011100110100011101011100111011110111001111101010010100111111100011111101010001110101100101111000111101001100011101011000101000010 f0a6b4e68eb9dee7d4a7f1fa8eb2f1e98eb1f0a6b4e68eb9dee7d4a7f1fa8eb2f1e98eb142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)