To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??循??猷ラ?殉?6肄??猷??循 10010111010100010011111100111111100011110111101000111111001111111001011101010001100000111000100100111111100011110111110100111111100000100101010111100011111001010011111100111111100101110101000100111111001111111000111101111010 97513f3f8f7a3f3f975183893f8f7d3f8255e3e53f3f97513f3f8f7a
EUC-JP 猷??循??猷ラ?殉?6肄??猷??循 11001101101100100011111100111111101111011101101100111111001111111100110110110010101001011110100100111111101111011101111000111111101000111011011011100110111001110011111100111111110011011011001000111111001111111011110111011011 cdb23f3fbddb3f3fcdb2a5e93fbdde3fa3b6e6e73f3fcdb23f3fbddb
UTF-8 猷드띂循꾟뼞猷ラ레殉믩6肄볝궊猷댁빴循 111001111000110010110111111010111001001110011100111010111001110110000010111001011011111010101010111010101011111010011111111010111011110010011110111001111000110010110111111000111000001110101001111010111010000010001000111001101010111010001001111010111010111110101001111011111011110010010110111010001000001010000100111010111011001110011101111010101011011010001010111001111000110010110111111010111000110010000001111010111011100110110100111001011011111010101010 e78cb7eb939ceb9d82e5beaaeabe9febbc9ee78cb7e383a9eba088e6ae89ebafa9efbc96e88284ebb39deab68ae78cb7eb8c81ebb9b4e5beaa
UHC 猷드띂循꾟뼞猷ラ레殉믩6肄볝궊猷댁빴循 1110101110100011101101011110010110001101101111011110001011100000100001001110001010010110101000011110101110100011101010111110100110110111101110011110001011100110100100101110101110100011101101101110110010111101100100111110001110000010101000011110101110100011101101001110110010111011101001101110001011100000 eba3b5e58dbde2e084e296a1eba3abe9b7b9e2e692eba3b6ecbd93e382a1eba3b4ecbba6e2e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)