To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ú´¥ú¶§ú´´tú´¥ú¶§ú´´tB 111110101011010010100101111110101011011010100111111110101011010010110100011101001111101010110100101001011111101010110110101001111111101010110100101101000111010001000010 fab4a5fab6a7fab4b474fab4a5fab6a7fab4b47442
SJIS-WIN ?´¥?¶§?´´t?´¥?¶§?´´tB 001111111000000101001100100000011000111100111111100000011111011110000001100110000011111110000001010011001000000101001100011101000011111110000001010011001000000110001111001111111000000111110111100000011001100000111111100000010100110010000001010011000111010001000010 3f814c818f3f81f781983f814c814c743f814c818f3f81f781983f814c814c7442
EUC-JP ú´?ú¶§ú´´tú´?ú¶§ú´´tB 10001111101010111110001010100001101011010011111110001111101010111110001010100010111110011010000111111000100011111010101111100010101000011010110110100001101011010111010010001111101010111110001010100001101011010011111110001111101010111110001010100010111110011010000111111000100011111010101111100010101000011010110110100001101011010111010001000010 8fabe2a1ad3f8fabe2a2f9a1f88fabe2a1ada1ad748fabe2a1ad3f8fabe2a2f9a1f88fabe2a1ada1ad7442
UTF-8 ú´¥ú¶§ú´´tú´¥ú¶§ú´´tB 110000111011101011000010101101001100001010100101110000111011101011000010101101101100001010100111110000111011101011000010101101001100001010110100011101001100001110111010110000101011010011000010101001011100001110111010110000101011011011000010101001111100001110111010110000101011010011000010101101000111010001000010 c3bac2b4c2a5c3bac2b6c2a7c3bac2b4c2b474c3bac2b4c2a5c3bac2b6c2a7c3bac2b4c2b47442
UHC ?´??¶§?´´t?´??¶§?´´tB 00111111101000101010010100111111001111111010001011010010101000011101011100111111101000101010010110100010101001010111010000111111101000101010010100111111001111111010001011010010101000011101011100111111101000101010010110100010101001010111010001000010 3fa2a53f3fa2d2a1d73fa2a5a2a5743fa2a53f3fa2d2a1d73fa2a5a2a57442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)