To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??淫???????筌??淫???????B 111000101010001100111111001111111000100011111010001111110011111100111111001111110011111100111111001111111110001010100011001111110011111110001000111110100011111100111111001111110011111100111111001111110011111101000010 e2a33f3f88fa3f3f3f3f3f3f3fe2a33f3f88fa3f3f3f3f3f3f3f42
EUC-JP 筌??淫???????筌??淫???????B 111001001010010100111111001111111011000011111100001111110011111100111111001111110011111100111111001111111110010010100101001111110011111110110000111111000011111100111111001111110011111100111111001111110011111101000010 e4a53f3fb0fc3f3f3f3f3f3f3fe4a53f3fb0fc3f3f3f3f3f3f3f42
UTF-8 筌뗫럭淫몌쭓琉우죰吏퉣筌뗫럭淫몌쭓琉우죰吏퉣B 11100111101011011000110011101011100101111010101111101011100111111010110111100110101101111010101111101011101010101000110011101100101011011001001111101111101001111000110011101100100110101011000011101100101000111011000011101111101001111001111011101101100010011010001111100111101011011000110011101011100101111010101111101011100111111010110111100110101101111010101111101011101010101000110011101100101011011001001111101111101001111000110011101100100110101011000011101100101000111011000011101111101001111001111011101101100010011010001101000010 e7ad8ceb97abeb9fade6b7abebaa8cecad93efa78cec9ab0eca3b0efa79eed89a3e7ad8ceb97abeb9fade6b7abebaa8cecad93efa78cec9ab0eca3b0efa79eed89a342
UHC 筌뗫럭淫몌쭓琉우죰吏퉣筌뗫럭淫몌쭓琉우죰吏퉣B 111011111010011110001011111010111011011110110000111010111110001010111000111011111010011110001011111010111010010010111111111011001010000110001011111011001010011110111001011101101110111110100111100010111110101110110111101100001110101111100010101110001110111110100111100010111110101110100100101111111110110010100001100010111110110010100111101110010111011001000010 efa78bebb7b0ebe2b8efa78beba4bfeca18beca7b976efa78bebb7b0ebe2b8efa78beba4bfeca18beca7b97642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)