To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蒻??節??節??厄ε?埃??檍??熬ュ?B 11100100111010000011111100111111100100001101111100111111001111111001000011011111001111110011111110010110111011111000001111000011001111111001101010111010001111110011111110011110111110000011111100111111111000001001001010000011100001010011111101000010 e4e83f3f90df3f3f90df3f3f96ef83c33f9aba3f3f9ef83f3fe09283853f42
EUC-JP 蒻??節??節?˙厄ε?埃??檍??熬ュ?B 111010001110101000111111001111111100000011100001001111110011111111000000111000010011111110001111101000101011001011001100111100011010011011000101001111111101010010111100001111110011111111011100111110100011111100111111110111111111001010100101111001010011111101000010 e8ea3f3fc0e13f3fc0e13f8fa2b2ccf1a6c53fd4bc3f3fdcfa3f3fdff2a5e53f42
UTF-8 蒻멨뫕節꿩뇥節김˙厄ε떳埃뤄쉿檍랃쉽熬ュ슖B 1110100010010010101110111110101110101001101010001110101110101011100101011110011110101111100000001110101010111111101010011110101110000111101001011110011110101111100000001110101010111001100000001100101110011001111001011000111010000100110011101011010111101011100101101011001111100101100111111000001111101011101001001000010011101100100010011011111111100110101010101000110111101011100111101000001111101100100010011011110111100111100001101010110011100011100000111010010111101100100010101001011001000010 e892bbeba9a8ebab95e7af80eabfa9eb87a5e7af80eab980cb99e58e84ceb5eb96b3e59f83eba484ec89bfe6aa8deb9e83ec89bde786ace383a5ec8a9642
UHC 蒻멨뫕節꿩뇥節김˙厄ε떳埃뤄쉿檍랃쉽熬ュ슖B 11100101101101101011100011100101100100011011011111101111101111011011001011100110100001111000110111101111101111011011000111101000101000101010101111100100111110001010010111100101101101101011100011100100111011111011011111101111101111011011001011100101111001011000110111101111101111011011000111101000101000101010101111100101100110101010010101000010 e5b6b8e591b7efbdb2e6878defbdb1e8a2abe4f8a5e5b6b8e4efb7efbdb2e5e58defbdb1e8a2abe59aa542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)