To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 厓ф?v厓ф?vB 11111010100011011000010010000110001111110111011011111010100011011000010010000110001111110111011001000010 fa8d84863f76fa8d84863f7642
EUC-JP 厓ф?v厓ф?vB 100011111011010011000111101001111110011000111111011101101000111110110100110001111010011111100110001111110111011001000010 8fb4c7a7e63f768fb4c7a7e63f7642
UTF-8 厓ф굄v厓ф굄vB 11100101100011101001001111010001100001001110101010110101100001000111011011100101100011101001001111010001100001001110101010110101100001000111011001000010 e58e93d184eab58476e58e93d184eab5847642
UHC 厓ф굄v厓ф굄vB 111001001110110110101100111001101011000110101111011101101110010011101101101011001110011010110001101011110111011001000010 e4edace6b1af76e4edace6b1af7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)