To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 猷?┰徇?5畑??Lh猷?┰徇?5畑??L 10010111010100010011111110000100101110111001110001101101001111111000001001010100100101001010100000111111001111110100110001101000100101110101000100111111100001001011101110011100011011010011111110000010010101001001010010101000001111110011111101001100 97513f84bb9c6d3f825494a83f3f4c6897513f84bb9c6d3f825494a83f3f4c
EUC-JP 猷?┰徇?5畑??Lh猷?┰徇?5畑??L 11001101101100100011111110101000101111011101011111001110001111111010001110110101110010001010101000111111001111110100110001101000110011011011001000111111101010001011110111010111110011100011111110100011101101011100100010101010001111110011111101001100 cdb23fa8bdd7ce3fa3b5c8aa3f3f4c68cdb23fa8bdd7ce3fa3b5c8aa3f3f4c
UTF-8 猷띠┰徇붾5畑겸뿉Lh猷띠┰徇붾5畑겸뿉L 111001111000110010110111111010111001110110100000111000101001010010110000111001011011111010000111111010111011011010111110111011111011110010010101111001111001010110010001111010101011001010111000111010111011111110001001010011000110100011100111100011001011011111101011100111011010000011100010100101001011000011100101101111101000011111101011101101101011111011101111101111001001010111100111100101011001000111101010101100101011100011101011101111111000100101001100 e78cb7eb9da0e294b0e5be87ebb6beefbc95e79591eab2b8ebbf894c68e78cb7eb9da0e294b0e5be87ebb6beefbc95e79591eab2b8ebbf894c
UHC 猷띠┰徇붾5畑겸뿉Lh猷띠┰徇붾5畑겸뿉L 111010111010001110110110111011001010011010111101111000101101111110010100111010111010001110110101111011111010010110110000111000101001011110010000010011000110100011101011101000111011011011101100101001101011110111100010110111111001010011101011101000111011010111101111101001011011000011100010100101111001000001001100 eba3b6eca6bde2df94eba3b5efa5b0e297904c68eba3b6eca6bde2df94eba3b5efa5b0e297904c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)