To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??循??揄??癲??夷??猷??域??膺? 11100010101000110011111100111111100011110111101000111111001111111001110110001001001111110011111111100001100111110011111100111111100010001100111000111111001111111001011101010001001111110011111110001000111001100011111100111111111001000101111000111111 e2a33f3f8f7a3f3f9d893f3fe19f3f3f88ce3f3f97513f3f88e63f3fe45e3f
EUC-JP 筌??循??揄??癲??夷??猷??域??膺? 11100100101001010011111100111111101111011101101100111111001111111101100111101001001111110011111111100010101000010011111100111111101100001101000000111111001111111100110110110010001111110011111110110000111010000011111100111111111001111011111100111111 e4a53f3fbddb3f3fd9e93f3fe2a13f3fb0d03f3fcdb23f3fb0e83f3fe7bf3f
UTF-8 筌뚮뿨循룝퓠揄쎈룚癲욧퀡夷긺뛾猷⑸쐡域듭쥏膺놟 111001111010110110001100111010111001101010101110111010111011111110101000111001011011111010101010111010111010001110011101111011011001001110100000111001101000111110000100111011001000111010001000111010111010001110011010111001111001100110110010111011001001101010100111111011011000000010100001111001011010010010110111111010101011100010111010111010111001101110111110111001111000110010110111111000101001000110111000111011001001000010100001111001011001111110011111111010111001001110101101111011001010010110001111111010001000011010111010111010111000011010011111 e7ad8ceb9aaeebbfa8e5beaaeba39ded93a0e68f84ec8e88eba39ae799b2ec9aa7ed80a1e5a4b7eab8baeb9bbee78cb7e291b8ec90a1e59f9feb93adeca58fe886baeb869f
UHC 筌뚮뿨循룝퓠揄쎈룚癲욧퀡夷긺뛾猷⑸쐡域듭쥏膺놟 11101111101001111000110011101011100101111010100011100010111000001011011111100100101111111000100111101010111100011011110111101011100011111001011011101111101001101011111111101010101100111001010111101100101010001011000111100111100011011000010011101011101000111010100111101011100111001000011111100110101101001011010111101100101000101000100011101011111011001000011101000010 efa78ceb97a8e2e0b7e4bf89eaf1bdeb8f96efa6bfeab395eca8b1e78d84eba3a9eb9c87e6b4b5eca288ebec8742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)