To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???▼?淫??筌??逾??儒??沃 1110000110011111001111110011111100111111100000011010010100111111100010001111101000111111001111111110001010100011001111110011111111100111101001010011111100111111100011101111001000111111001111111001011110000000 e19f3f3f3f81a53f88fa3f3fe2a33f3fe7a53f3f8ef23f3f9780
EUC-JP 癲??沅▼?淫??筌??逾??儒??沃 11100010101000010011111100111111100011111100011011101001101000101010011100111111101100001111110000111111001111111110010010100101001111110011111111101110101001110011111100111111101111001111010000111111001111111100110111100000 e2a13f3f8fc6e9a2a73fb0fc3f3fe4a53f3feea73f3fbcf43f3fcde0
UTF-8 癲됱떝沅▼뵱淫놃뮍筌뚭풜逾뽫뫀儒좏돫沃 111001111001100110110010111010111001000010110001111010111001011010011101111001101011001010000101111000101001011010111100111010111011010110110001111001101011011110101011111010111000011010000011111010111010111010001101111001111010110110001100111010111001101010101101111011011001001010011100111010011000000010111110111010111011110110101011111010111010101110000000111001011000010010010010111011001010001010001111111010111000111110101011111001101011001010000011 e799b2eb90b1eb969de6b285e296bcebb5b1e6b7abeb8683ebae8de7ad8ceb9aaded929ce980beebbdabebab80e58492eca28feb8fabe6b283
UHC 癲됱떝沅▼뵱淫놃뮍筌뚭풜逾뽫뫀儒좏돫沃 1110111110100110100010011110110010001011101100111110101010110110101000011110010110010100101011111110101111100010100001101110110110010010100110101110111110100111100011001110101010111110100111111110101110110101100101101110011110010001101001001110101011100011101000001110110110001001101011101110100010101010 efa689ec8bb3eab6a1e594afebe286ed929aefa78ceabe9febb596e791a4eae3a0ed89aee8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)