To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??而??儀??筌??逾??蟻??筌?? 11100010101000110011111100111111100011101010011100111111001111111000101101010110001111110011111111100010101000110011111100111111111001111010010100111111001111111000101101100001001111110011111111100010101000110011111100111111 e2a33f3f8ea73f3f8b563f3fe2a33f3fe7a53f3f8b613f3fe2a33f3f
EUC-JP 筌??而??儀??筌??逾??蟻??筌?? 11100100101001010011111100111111101111001010100100111111001111111011010110110111001111110011111111100100101001010011111100111111111011101010011100111111001111111011010111000010001111110011111111100100101001010011111100111111 e4a53f3fbca93f3fb5b73f3fe4a53f3feea73f3fb5c23f3fe4a53f3f
UTF-8 筌뗭궏而⒵썣儀뺤젴筌뗭궍逾쒙쭓蟻숈죳筌뗫썔 111001111010110110001100111010111001011110101101111010101011011010001111111010001000000010001100111000101001001010110101111011001000110110100011111001011000010010000000111010111011101010100100111011001010000010110100111001111010110110001100111010111001011110101101111010101011011010001101111010011000000010111110111011001001001010011001111011001010110110010011111010001001111110111011111011001000100010001000111011001010001110110011111001111010110110001100111010111001011110101011111011001000110110010100 e7ad8ceb97adeab68fe8808ce292b5ec8da3e58480ebbaa4eca0b4e7ad8ceb97adeab68de980beec9299ecad93e89fbbec8888eca3b3e7ad8ceb97abec8d94
UHC 筌뗭궏而⒵썣儀뺤젴筌뗭궍逾쒙쭓蟻숈죳筌뗫썔 111011111010011110001011111011001000001010100101111011001011101110101001111001101001101110010110111010111111000010010101111011001010000010101000111011111010011110001011111011001000001010100011111010111011010110011100111011111010011110001011111010111111110010011001111011001010000110001110111011111010011110001011111010111001101110000111 efa78bec82a5ecbba9e69b96ebf095eca0a8efa78bec82a3ebb59cefa78bebfc99eca18eefa78beb9b87

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)