To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 筌≪?泣?????i筌≪?泣?????iB 111000101010001110000001111000010011111110001011100000110011111100111111001111110011111100111111011010011110001010100011100000011110000100111111100010111000001100111111001111110011111100111111001111110110100101000010 e2a381e13f8b833f3f3f3f3f69e2a381e13f8b833f3f3f3f3f6942
EUC-JP 筌≪?泣?????i筌≪?泣?????iB 111001001010010110100010111000110011111110110101111000110011111100111111001111110011111100111111011010011110010010100101101000101110001100111111101101011110001100111111001111110011111100111111001111110110100101000010 e4a5a2e33fb5e33f3f3f3f3f69e4a5a2e33fb5e33f3f3f3f3f6942
UTF-8 筌≪눛泣꾬㎖類ㅺ퍥i筌≪눛泣꾬㎖類ㅺ퍥iB 111001111010110110001100111000101000100110101010111010111000100010011011111001101011001110100011111010101011111010101100111000111000111010010110111011111010011110010000111000111000010110111010111011011000110110100101011010011110011110101101100011001110001010001001101010101110101110001000100110111110011010110011101000111110101010111110101011001110001110001110100101101110111110100111100100001110001110000101101110101110110110001101101001010110100101000010 e7ad8ce289aaeb889be6b3a3eabeace38e96efa790e385baed8da569e7ad8ce289aaeb889be6b3a3eabeace38e96efa790e385baed8da56942
UHC 筌≪눛泣꾬㎖類ㅺ퍥i筌≪눛泣꾬㎖類ㅺ퍥iB 111011111010011110100001111011001000011110110011111010111110100010000100111011111010011110100010111010111011101010100100111010101011101110011100011010011110111110100111101000011110110010000111101100111110101111101000100001001110111110100111101000101110101110111010101001001110101010111011100111000110100101000010 efa7a1ec87b3ebe884efa7a2ebbaa4eabb9c69efa7a1ec87b3ebe884efa7a2ebbaa4eabb9c6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)