To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰?????淫?????異??碎k?沃??異 1000100110000001001111110011111100111111001111110011111110001000111110100011111100111111001111110011111100111111100010001101100100111111001111111110000111101010100000101000101100111111100101111000000000111111001111111000100011011001 89813f3f3f3f3f88fa3f3f3f3f3f88d93f3fe1ea828b3f97803f3f88d9
EUC-JP 堰?????淫?????異??碎k?沃??異 1011000111100001001111110011111100111111001111110011111110110000111111000011111100111111001111110011111100111111101100001101101100111111001111111110001011101100101000111110101100111111110011011110000000111111001111111011000011011011 b1e13f3f3f3f3fb0fc3f3f3f3f3fb0db3f3fe2eca3eb3fcde03f3fb0db
UTF-8 堰묐쓷流쒎렘淫볝룄亮쎈슣異녘첑碎k궖沃띾ㅉ異 111001011010000010110000111010111010110010010000111011001001001110110111111011111010011110001010111011001001001010001110111010111010000010011000111001101011011110101011111010111011001110011101111010111010001110000100111011111010010110110111111011001000111010001000111011001000101010100011111001111001010110110000111010111000010110011000111011001011001010010001111001111010001010001110111011111011110110001011111010101011011010010110111001101011001010000011111010111001110110111110111000111000010110001001111001111001010110110000 e5a0b0ebac90ec93b7efa78aec928eeba098e6b7abebb39deba384efa5b7ec8e88ec8aa3e795b0eb8598ecb291e7a28eefbd8beab696e6b283eb9dbee38589e795b0
UHC 堰묐쓷流쒎렘淫볝룄亮쎈슣異녘첑碎k궖沃띾ㅉ異 1110010111101000100100011110101110011101100101001110101011111100100111001110010110110111101111011110101111100010100100111110001110001111100001001110010110111001101111011110101110011010101011111110110010110110101100111110100010101010100111101110000111101111101000111110101110000010101010111110100010101010100011011110101110100100101110011110110010110110 e5e891eb9d94eafc9ce5b7bdebe293e38f84e5b9bdeb9aafecb6b3e8aa9ee1efa3eb82abe8aa8deba4b9ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)