To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 永??域??孃∽?v永??域??孃∽?vB 1000100101101001001111110011111110001000111001100011111100111111100110110110111110000001111001000011111101110110100010010110100100111111001111111000100011100110001111110011111110011011011011111000000111100100001111110111011001000010 89693f3f88e63f3f9b6f81e43f7689693f3f88e63f3f9b6f81e43f7642
EUC-JP 永??域??孃∽?v永??域??孃∽?vB 1011000111001010001111110011111110110000111010000011111100111111110101011101000010100010111001100011111101110110101100011100101000111111001111111011000011101000001111110011111111010101110100001010001011100110001111110111011001000010 b1ca3f3fb0e83f3fd5d0a2e63f76b1ca3f3fb0e83f3fd5d0a2e63f7642
UTF-8 永쏃넀域꿰탿孃∽슉v永쏃넀域꿰탿孃∽슉vB 111001101011000010111000111011001000111110000011111010111000010010000000111001011001111110011111111010101011111110110000111011011000001110111111111001011010110110000011111000101000100010111101111011001000101010001001011101101110011010110000101110001110110010001111100000111110101110000100100000001110010110011111100111111110101010111111101100001110110110000011101111111110010110101101100000111110001010001000101111011110110010001010100010010111011001000010 e6b0b8ec8f83eb8480e59f9feabfb0ed83bfe5ad83e288bdec8a8976e6b0b8ec8f83eb8480e59f9feabfb0ed83bfe5ad83e288bdec8a897642
UHC 永쏃넀域꿰탿孃∽슉v永쏃넀域꿰탿孃∽슉vB 111001111011010110011011111010011000011010010000111001101011010010110010111001111011010110011011111001011011111010100001111011111011110110110101011101101110011110110101100110111110100110000110100100001110011010110100101100101110011110110101100110111110010110111110101000011110111110111101101101010111011001000010 e7b59be98690e6b4b2e7b59be5bea1efbdb576e7b59be98690e6b4b2e7b59be5bea1efbdb57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)