To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪他贈誰遜測竪他贈誰遜測B 10010010010001111001000110111100100100011010000110010010010011101001000110111011100100011010101010010010010001111001000110111100100100011010000110010010010011101001000110111011100100011010101001000010 924791bc91a1924e91bb91aa924791bc91a1924e91bb91aa42
EUC-JP 竪他贈誰遜測竪他贈誰遜測B 11000011101010001100001010111110110000101010001111000011101011111100001010111101110000101010110011000011101010001100001010111110110000101010001111000011101011111100001010111101110000101010110001000010 c3a8c2bec2a3c3afc2bdc2acc3a8c2bec2a3c3afc2bdc2ac42
UTF-8 竪他贈誰遜測竪他贈誰遜測B 11100111101010111010101011100100101110111001011011101000101101001000100011101000101010101011000011101001100000011001110011100110101110001010110011100111101010111010101011100100101110111001011011101000101101001000100011101000101010101011000011101001100000011001110011100110101110001010110001000010 e7abaae4bb96e8b488e8aab0e9819ce6b8ace7abaae4bb96e8b488e8aab0e9819ce6b8ac42
UHC 竪他贈誰遜測竪他贈誰遜測B 11100010101101011111011011100010111100011111110011100010110000011110000111100001111101101011010011100010101101011111011011100010111100011111110011100010110000011110000111100001111101101011010001000010 e2b5f6e2f1fce2c1e1e1f6b4e2b5f6e2f1fce2c1e1e1f6b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)