To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????h???? 001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f683f3f3f3f
SJIS-WIN 他谷遜短h他谷遜短 1001000110111100100100100100101010010001101110111001001001011010011010001001000110111100100100100100101010010001101110111001001001011010 91bc924a91bb925a6891bc924a91bb925a
EUC-JP 他谷遜短h他谷遜短 1100001010111110110000111010101111000010101111011100001110111011011010001100001010111110110000111010101111000010101111011100001110111011 c2bec3abc2bdc3bb68c2bec3abc2bdc3bb
UTF-8 他谷遜短h他谷遜短 11100100101110111001011011101000101100001011011111101001100000011001110011100111100111111010110101101000111001001011101110010110111010001011000010110111111010011000000110011100111001111001111110101101 e4bb96e8b0b7e9819ce79fad68e4bb96e8b0b7e9819ce79fad
UHC 他谷遜短h他谷遜短 1111011011100010110011011101101111100001111000011101001110101101011010001111011011100010110011011101101111100001111000011101001110101101 f6e2cddbe1e1d3ad68f6e2cddbe1e1d3ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)