To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 šh捁{šh譖€}išh捁{šh譖€}iB 1001101001101000111001101000110110000001011110111001101001101000111010001010110110010110100000000111110101101001100110100110100011100110100011011000000101111011100110100110100011101000101011011001011010000000011111010110100101000010 9a68e68d817b9a68e8ad96807d699a68e68d817b9a68e8ad96807d6942
SJIS-WIN ?h???{?h????}i?h???{?h????}iB 0011111101101000001111110011111100111111011110110011111101101000001111110011111100111111001111110111110101101001001111110110100000111111001111110011111101111011001111110110100000111111001111110011111100111111011111010110100101000010 3f683f3f3f7b3f683f3f3f3f7d693f683f3f3f7b3f683f3f3f3f7d6942
EUC-JP ?hæ??{?hè???}i?hæ??{?hè???}iB 00111111011010001000111110101001110000010011111100111111011110110011111101101000100011111010101110110010001111110011111100111111011111010110100100111111011010001000111110101001110000010011111100111111011110110011111101101000100011111010101110110010001111110011111100111111011111010110100101000010 3f688fa9c13f3f7b3f688fabb23f3f3f7d693f688fa9c13f3f7b3f688fabb23f3f3f7d6942
UTF-8 šh捁{šh譖€}išh捁{šh譖€}iB 1100001010011010011010001100001110100110110000101000110111000010100000010111101111000010100110100110100011000011101010001100001010101101110000101001011011000010100000000111110101101001110000101001101001101000110000111010011011000010100011011100001010000001011110111100001010011010011010001100001110101000110000101010110111000010100101101100001010000000011111010110100101000010 c29a68c3a6c28dc2817bc29a68c3a8c2adc296c2807d69c29a68c3a6c28dc2817bc29a68c3a8c2adc296c2807d6942
UHC ?hæ??{?h?­??}i?hæ??{?h?­??}iB 001111110110100010101001101000010011111100111111011110110011111101101000001111111010000110101001001111110011111101111101011010010011111101101000101010011010000100111111001111110111101100111111011010000011111110100001101010010011111100111111011111010110100101000010 3f68a9a13f3f7b3f683fa1a93f3f7d693f68a9a13f3f7b3f683fa1a93f3f7d6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)