To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?縡??絅???絅?兢??伊逗?源??絅?臼 100100101010001000111111111000110111000100111111001111111110001101000100001111110011111100111111111000110100010000111111100110010101110100111111001111111000100011001001100100001000000000111111100011001011100100111111001111111110001101000100001111111000100101010000 92a23fe3713f3fe3443f3f3fe3443f995d3f3f88c990803f8cb93f3fe3443f8950
EUC-JP 弔?縡?饔絅??饔絅?兢??伊逗汶源?饔絅?臼 1100010010100100001111111110010111010010001111111000111111101000111011111110010110100101001111110011111110001111111010001110111111100101101001010011111111010001101111100011111100111111101100001100101110111111111000001000111111000110111001011011100010111011001111111000111111101000111011111110010110100101001111111011000110110001 c4a43fe5d23f8fe8efe5a53f3f8fe8efe5a53fd1be3f3fb0cbbfe08fc6e5b8bb3f8fe8efe5a53fb1b1
UTF-8 弔렟縡렕饔絅렩렰饔絅렚兢렩렰伊逗汶源렰饔絅렚臼 111001011011110010010100111010111010000010011111111001111011100010100001111010111010000010010101111010011010010110010100111001111011010110000101111010111010000010101001111010111010000010110000111010011010010110010100111001111011010110000101111010111010000010011010111001011000010110100010111010111010000010101001111010111010000010110000111001001011110010001010111010011000000010010111111001101011000110110110111001101011101010010000111010111010000010110000111010011010010110010100111001111011010110000101111010111010000010011010111010001000011110111100 e5bc94eba09fe7b8a1eba095e9a594e7b585eba0a9eba0b0e9a594e7b585eba09ae585a2eba0a9eba0b0e4bc8ae98097e6b1b6e6ba90eba0b0e9a594e7b585eba09ae887bc
UHC 弔렟縡렕饔絅렩렰饔絅렚兢렩렰伊逗汶源렰饔絅렚臼 11110000110000001000111010110000111011101010110110001110101010101110100010111101110011001110011110001110101101111000111010111101111010001011110111001100111001111000111010101101110100001110011110001110101101111000111010111101111011001010010111010100111010001101101010100001111010101011100110001110101111011110100010111101110011001110011110001110101011011100111110111111 f0c08eb0eead8eaae8bdcce78eb78ebde8bdcce78eadd0e78eb78ebdeca5d4e8daa1eab98ebde8bdcce78eadcfbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)