To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??溢g?兪??阿??艤??節??泳 111000011001111100111111001111111000100011101100100000101000011100111111100110010110000000111111001111111000100010100010001111110011111111100100011111100011111100111111100100001101111100111111001111111000100101101010 e19f3f3f88ec82873f99603f3f88a23f3fe47e3f3f90df3f3f896a
EUC-JP 癲??溢g?兪??阿??艤??節??泳 111000101010000100111111001111111011000011101110101000111110011100111111110100011100000100111111001111111011000010100100001111110011111111100111110111110011111100111111110000001110000100111111001111111011000111001011 e2a13f3fb0eea3e73fd1c13f3fb0a43f3fe7df3f3fc0e13f3fb1cb
UTF-8 癲뚮씛溢g뙴兪낆춳阿숋퐢艤븃돳節뗫깽泳 111001111001100110110010111010111001101010101110111011001001010010011011111001101011101010100010111011111011110110000111111010111001100110110100111001011000010110101010111010111000001010000110111011001011011010110011111010011001100010111111111011001000100010001011111011011001000010100010111010001000100110100100111010111011100010000011111010111000111110110011111001111010111110000000111010111001011110101011111010101011100110111101111001101011001110110011 e799b2eb9aaeec949be6baa2efbd87eb99b4e585aaeb8286ecb6b3e998bfec888bed90a2e889a4ebb883eb8fb3e7af80eb97abeab9bde6b3b3
UHC 癲뚮씛溢g뙴兪낆춳阿숋퐢艤븃돳節뗫깽泳 1110111110100110100011001110101110011101101100001110110011101110101000111110011110001100101101111110101011100100100001011110110010101101100011111110010010111001100110011110111110111101100010111110101111111010101110101110100010001001101101101110111110111101100010111110101110110010101001001110011110110110 efa68ceb9db0eceea3e78cb7eae485ecad8fe4b999efbd8bebfabae889b6efbd8bebb2a4e7b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)