To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 厓?厓??娃??}v厓?厓??娃??}vB 111110101000110100111111111110101000110100111111001111111000100010100001001111110011111101111101011101101111101010001101001111111111101010001101001111110011111110001000101000010011111100111111011111010111011001000010 fa8d3ffa8d3f3f88a13f3f7d76fa8d3ffa8d3f3f88a13f3f7d7642
EUC-JP 厓?厓??娃??}v厓?厓??娃??}vB 10001111101101001100011100111111100011111011010011000111001111110011111110110000101000110011111100111111011111010111011010001111101101001100011100111111100011111011010011000111001111110011111110110000101000110011111100111111011111010111011001000010 8fb4c73f8fb4c73f3fb0a33f3f7d768fb4c73f8fb4c73f3fb0a33f3f7d7642
UTF-8 厓쯣厓⒴콪娃붷춼}v厓쯣厓⒴콪娃붷춼}vB 1110010110001110100100111110110010101111101000111110010110001110100100111110001010010010101101001110110010111101101010101110010110101000100000111110101110110110101101111110110010110110101111000111110101110110111001011000111010010011111011001010111110100011111001011000111010010011111000101001001010110100111011001011110110101010111001011010100010000011111010111011011010110111111011001011011010111100011111010111011001000010 e58e93ecafa3e58e93e292b4ecbdaae5a883ebb6b7ecb6bc7d76e58e93ecafa3e58e93e292b4ecbdaae5a883ebb6b7ecb6bc7d7642
UHC 厓쯣厓⒴콪娃붷춼}v厓쯣厓⒴콪娃붷춼}vB 11100100111011011010100101100100111001001110110110101001111001011011000110011110111010001101111110010100111001011010110110011000011111010111011011100100111011011010100101100100111001001110110110101001111001011011000110011110111010001101111110010100111001011010110110011000011111010111011001000010 e4eda964e4eda9e5b19ee8df94e5ad987d76e4eda964e4eda9e5b19ee8df94e5ad987d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)