To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 五γ?泣?Р酉??i五γ?泣?Р酉??iB 10001100110111001000001111000001001111111000101110000011001111111000010001010001100100111101000100111111001111110110100110001100110111001000001111000001001111111000101110000011001111111000010001010001100100111101000100111111001111110110100101000010 8cdc83c13f8b833f845193d13f3f698cdc83c13f8b833f845193d13f3f6942
EUC-JP 五γ?泣?Р酉??i五γ?泣?Р酉??iB 10111000110111101010011011000011001111111011010111100011001111111010011110110010110001101101001100111111001111110110100110111000110111101010011011000011001111111011010111100011001111111010011110110010110001101101001100111111001111110110100101000010 b8dea6c33fb5e33fa7b2c6d33f3f69b8dea6c33fb5e33fa7b2c6d33f3f6942
UTF-8 五γ룇泣먩Р酉귥돽i五γ룇泣먩Р酉귥돽iB 1110010010111010100101001100111010110011111010111010001110000111111001101011001110100011111010111010100010101001110100001010000011101001100001011000100111101010101101111010010111101011100011111011110101101001111001001011101010010100110011101011001111101011101000111000011111100110101100111010001111101011101010001010100111010000101000001110100110000101100010011110101010110111101001011110101110001111101111010110100101000010 e4ba94ceb3eba387e6b3a3eba8a9d0a0e98589eab7a5eb8fbd69e4ba94ceb3eba387e6b3a3eba8a9d0a0e98589eab7a5eb8fbd6942
UHC 五γ룇泣먩Р酉귥돽i五γ룇泣먩Р酉귥돽iB 111001111110100110100101111000111000111110000110111010111110100010010000111001101010110010110010111010111011011110000010111011001000100110111111011010011110011111101001101001011110001110001111100001101110101111101000100100001110011010101100101100101110101110110111100000101110110010001001101111110110100101000010 e7e9a5e38f86ebe890e6acb2ebb782ec89bf69e7e9a5e38f86ebe890e6acb2ebb782ec89bf6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)