To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 哀????????異?哀????????異?B 100010001010001100111111001111110011111100111111001111110011111100111111001111111000100011011001001111111000100010100011001111110011111100111111001111110011111100111111001111110011111110001000110110010011111101000010 88a33f3f3f3f3f3f3f3f88d93f88a33f3f3f3f3f3f3f3f88d93f42
EUC-JP 哀????????異?哀????????異?B 101100001010010100111111001111110011111100111111001111110011111100111111001111111011000011011011001111111011000010100101001111110011111100111111001111110011111100111111001111110011111110110000110110110011111101000010 b0a53f3f3f3f3f3f3f3fb0db3fb0a53f3f3f3f3f3f3f3fb0db3f42
UTF-8 哀잙젦緣욏렖淋끻떀異늲哀잙젦緣욏렖淋끻떀異늲B 11100101100100111000000011101100100111101001100111101100101000001010011011100111101101111010001111101100100110101000111111101011101000001001011011101111101001111011010111101011100000011011101111101011100101101000000011100111100101011011000011101011100010101011001011100101100100111000000011101100100111101001100111101100101000001010011011100111101101111010001111101100100110101000111111101011101000001001011011101111101001111011010111101011100000011011101111101011100101101000000011100111100101011011000011101011100010101011001001000010 e59380ec9e99eca0a6e7b7a3ec9a8feba096efa7b5eb81bbeb9680e795b0eb8ab2e59380ec9e99eca0a6e7b7a3ec9a8feba096efa7b5eb81bbeb9680e795b0eb8ab242
UHC 哀잙젦緣욏렖淋끻떀異늲哀잙젦緣욏렖淋끻떀異늲B 111001001110111010011111111010111010000010011110111001101101111010011110111011011000111010101011111011001111100010000101111001011000101110010110111011001011011010001000011101101110010011101110100111111110101110100000100111101110011011011110100111101110110110001110101010111110110011111000100001011110010110001011100101101110110010110110100010000111011001000010 e4ee9feba09ee6de9eed8eabecf885e58b96ecb68876e4ee9feba09ee6de9eed8eabecf885e58b96ecb6887642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)