To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰??受??娃??徇??音≪?? 1110000110011111001111110011111111100000101001110011111100111111100011101111001100111111001111111000100010100001001111110011111110011100011011010011111100111111100010011011100110000001111000010011111100111111 e19f3f3fe0a73f3f8ef33f3f88a13f3f9c6d3f3f89b981e13f3f
EUC-JP 癲??爰??受??娃??徇??音≪?彛 11100010101000010011111100111111111000001010100100111111001111111011110011110101001111110011111110110000101000110011111100111111110101111100111000111111001111111011001010111011101000101110001100111111100011111011110011111010 e2a13f3fe0a93f3fbcf53f3fb0a33f3fd7ce3f3fb2bba2e33f8fbcfa
UTF-8 癲ㅺ퓭爰귝끽受쎌뒴娃븍툖徇띺샒音≪궡彛 111001111001100110110010111000111000010110111010111011011001001110101101111001111000100010110000111010101011011110011101111010111000000110111101111001011000111110010111111011001000111010001100111010111001001010110100111001011010100010000011111010111011100010001101111011011000100010010110111001011011111010000111111010111001110110111010111011001000001110010010111010011001111110110011111000101000100110101010111010101011011010100001111001011011110110011011 e799b2e385baed93ade788b0eab79deb81bde58f97ec8e8ceb92b4e5a883ebb88ded8896e5be87eb9dbaec8392e99fb3e289aaeab6a1e5bd9b
UHC 癲ㅺ퓭爰귝끽受쎌뒴娃븍툖徇띺샒音≪궡彛 1110111110100110101001001110101010111111100101001110101010111010100000101110011010110011101000111110000111110100101111011110110010001010101011011110100011011111101110101110101110111000100011011110001011011111100011011110100110011000101111111110101111100101101000011110110010000010101101001110110010101101 efa6a4eabf94eaba82e6b3a3e1f4bdec8aade8dfbaebb88de2df8de998bfebe5a1ec82b4ecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)