To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 猷?????猷??v猷?????猷??vB 10010111010100010011111100111111001111110011111100111111100101110101000100111111001111110111011010010111010100010011111100111111001111110011111100111111100101110101000100111111001111110111011001000010 97513f3f3f3f3f97513f3f7697513f3f3f3f3f97513f3f7642
EUC-JP 猷?????猷??v猷?????猷??vB 11001101101100100011111100111111001111110011111100111111110011011011001000111111001111110111011011001101101100100011111100111111001111110011111100111111110011011011001000111111001111110111011001000010 cdb23f3f3f3f3fcdb23f3f76cdb23f3f3f3f3fcdb23f3f7642
UTF-8 猷뜰걖烈욏뫖猷띈킊v猷뜰걖烈욏뫖猷띈킊vB 111001111000110010110111111010111001110010110000111010101011000110010110111011111010011010011111111011001001101010001111111010111010101110010110111001111000110010110111111010111001110110001000111011011000001010001010011101101110011110001100101101111110101110011100101100001110101010110001100101101110111110100110100111111110110010011010100011111110101110101011100101101110011110001100101101111110101110011101100010001110110110000010100010100111011001000010 e78cb7eb9cb0eab196efa69fec9a8febab96e78cb7eb9d88ed828a76e78cb7eb9cb0eab196efa69fec9a8febab96e78cb7eb9d88ed828a7642
UHC 猷뜰걖烈욏뫖猷띈킊v猷뜰걖烈욏뫖猷띈킊vB 111010111010001110110110111000111000000110000001111001101110111110011110111011011001000110111000111010111010001110110110111010001011010010010110011101101110101110100011101101101110001110000001100000011110011011101111100111101110110110010001101110001110101110100011101101101110100010110100100101100111011001000010 eba3b6e38181e6ef9eed91b8eba3b6e8b49676eba3b6e38181e6ef9eed91b8eba3b6e8b4967642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)