To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 êþ±ë šçúˆì„žêþ±ë ŽëãQB 1110101011111110101100011110101110100000100110101110011111111010100010001110110010000100100111101110101011111110101100011110101110100000100011101110101111100011100011010101000101000010 eafeb1eba09ae7fa88ec849eeafeb1eba08eebe38d5142
SJIS-WIN ??±???????????±??????QB 00111111001111111000000101111101001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110000001011111010011111100111111001111110011111100111111001111110101000101000010 3f3f817d3f3f3f3f3f3f3f3f3f3f3f817d3f3f3f3f3f3f5142
EUC-JP êþ±ë??çú?ì??êþ±ë??ëã?QB 1000111110101011101101001000111110101001110100001010000111011110100011111010101110110011001111110011111110001111101010111010111010001111101010111110001000111111100011111010101111000000001111110011111110001111101010111011010010001111101010011101000010100001110111101000111110101011101100110011111100111111100011111010101110110011100011111010101110101010001111110101000101000010 8fabb48fa9d0a1de8fabb33f3f8fabae8fabe23f8fabc03f3f8fabb48fa9d0a1de8fabb33f3f8fabb38fabaa3f5142
UTF-8 êþ±ë šçúˆì„žêþ±ë ŽëãQB 1100001110101010110000111011111011000010101100011100001110101011110000101010000011000010100110101100001110100111110000111011101011000010100010001100001110101100110000101000010011000010100111101100001110101010110000111011111011000010101100011100001110101011110000101010000011000010100011101100001110101011110000111010001111000010100011010101000101000010 c3aac3bec2b1c3abc2a0c29ac3a7c3bac288c3acc284c29ec3aac3bec2b1c3abc2a0c28ec3abc3a3c28d5142
UHC ?þ±??????????þ±??????QB 001111111010100110101101101000011011111000111111001111110011111100111111001111110011111100111111001111110011111100111111101010011010110110100001101111100011111100111111001111110011111100111111001111110101000101000010 3fa9ada1be3f3f3f3f3f3f3f3f3f3fa9ada1be3f3f3f3f3f3f5142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)