To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 猷??娃?6二??N}猷??娃?6二??N{^ 10010111010100010011111100111111100010001010000100111111100000100101010110010011111100010011111100111111010011100111110110010111010100010011111100111111100010001010000100111111100000100101010110010011111100010011111100111111010011100111101101011110 97513f3f88a13f825593f13f3f4e7d97513f3f88a13f825593f13f3f4e7b5e
EUC-JP 猷??娃?6二??N}猷??娃?6二??N{^ 11001101101100100011111100111111101100001010001100111111101000111011011011000110111100110011111100111111010011100111110111001101101100100011111100111111101100001010001100111111101000111011011011000110111100110011111100111111010011100111101101011110 cdb23f3fb0a33fa3b6c6f33f3f4e7dcdb23f3fb0a33fa3b6c6f33f3f4e7b5e
UTF-8 猷듭뿉娃좊6二늚뎣N}猷듭뿉娃좊6二늚뎣N{^ 1110011110001100101101111110101110010011101011011110101110111111100010011110010110101000100000111110110010100010100010101110111110111100100101101110010010111010100011001110101110001010100110101110101110001110101000110100111001111101111001111000110010110111111010111001001110101101111010111011111110001001111001011010100010000011111011001010001010001010111011111011110010010110111001001011101010001100111010111000101010011010111010111000111010100011010011100111101101011110 e78cb7eb93adebbf89e5a883eca28aefbc96e4ba8ceb8a9aeb8ea34e7de78cb7eb93adebbf89e5a883eca28aefbc96e4ba8ceb8a9aeb8ea34e7b5e
UHC 猷듭뿉娃좊6二늚뎣N}猷듭뿉娃좊6二늚뎣N{^ 1110101110100011101101011110110010010111100100001110100011011111101000001110101110100011101101101110110010100011101101001100010110001001011100100100111001111101111010111010001110110101111011001001011110010000111010001101111110100000111010111010001110110110111011001010001110110100110001011000100101110010010011100111101101011110 eba3b5ec9790e8dfa0eba3b6eca3b4c589724e7deba3b5ec9790e8dfa0eba3b6eca3b4c589724e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)