To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 絶??碍??域ε?v絶??碍??域ε?vB 1001000011100010001111110011111110001010010101100011111100111111100010001110011010000011110000110011111101110110100100001110001000111111001111111000101001010110001111110011111110001000111001101000001111000011001111110111011001000010 90e23f3f8a563f3f88e683c33f7690e23f3f8a563f3f88e683c33f7642
EUC-JP 絶??碍??域ε?v絶??碍??域ε?vB 1100000011100100001111110011111110110011101101110011111100111111101100001110100010100110110001010011111101110110110000001110010000111111001111111011001110110111001111110011111110110000111010001010011011000101001111110111011001000010 c0e43f3fb3b73f3fb0e8a6c53f76c0e43f3fb3b73f3fb0e8a6c53f7642
UTF-8 絶귝슅碍딉푴域ε쎋v絶귝슅碍딉푴域ε쎋vB 11100111101101011011011011101010101101111001110111101100100010101000010111100111101000101000110111101011100101001000100111101101100100011011010011100101100111111001111111001110101101011110110010001110100010110111011011100111101101011011011011101010101101111001110111101100100010101000010111100111101000101000110111101011100101001000100111101101100100011011010011100101100111111001111111001110101101011110110010001110100010110111011001000010 e7b5b6eab79dec8a85e7a28deb9489ed91b4e59f9fceb5ec8e8b76e7b5b6eab79dec8a85e7a28deb9489ed91b4e59f9fceb5ec8e8b7642
UHC 絶귝슅碍딉푴域ε쎋v絶귝슅碍딉푴域ε쎋vB 111011111011111010000010111001101001101010010111111001001111010010001010111011111011111010000010111001101011010010100101111001011001101110110011011101101110111110111110100000101110011010011010100101111110010011110100100010101110111110111110100000101110011010110100101001011110010110011011101100110111011001000010 efbe82e69a97e4f48aefbe82e6b4a5e59bb376efbe82e69a97e4f48aefbe82e6b4a5e59bb37642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)