To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 砥??繃v砥??繃vB 100100110111010100111111001111111110001101111101011101101001001101110101001111110011111111100011011111010111011001000010 93753f3fe37d7693753f3fe37d7642
EUC-JP 砥??繃v砥??繃vB 110001011101011000111111001111111110010111011110011101101100010111010110001111110011111111100101110111100111011001000010 c5d63f3fe5de76c5d63f3fe5de7642
UTF-8 砥쾨렩繃v砥쾨렩繃vB 111001111010000010100101111011001011111010101000111010111010000010101001111001111011100110000011011101101110011110100000101001011110110010111110101010001110101110100000101010011110011110111001100000110111011001000010 e7a0a5ecbea8eba0a9e7b98376e7a0a5ecbea8eba0a9e7b9837642
UHC 砥쾨렩繃v砥쾨렩繃vB 11110010101100101100010011101010100011101011011111011101110111100111011011110010101100101100010011101010100011101011011111011101110111100111011001000010 f2b2c4ea8eb7ddde76f2b2c4ea8eb7ddde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)