To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 五γ????楡り?慂щ????純??孃 100011001101110010000011110000010011111100111111001111110011111110011110101111101000001011101000001111111001110011001000100001001000101100111111001111110011111100111111100011111000001100111111001111111001101101101111 8cdc83c13f3f3f3f9ebe82e83f9cc8848b3f3f3f3f8f833f3f9b6f
EUC-JP 五γ?佾??楡り?慂щ?嫄??純??孃 10111000110111101010011011000011001111111000111110110000111110110011111100111111110111001100000010100100111010100011111111011000110010101010011111101011001111111000111110111010101000010011111100111111101111011110001100111111001111111101010111010000 b8dea6c33f8fb0fb3f3fdcc0a4ea3fd8caa7eb3f8fbaa13f3fbde33f3fd5d0
UTF-8 五γ룢佾붷푻楡り탾慂щ겧嫄싪떏純쏇뜑孃 11100100101110101001010011001110101100111110101110100011101000101110010010111101101111101110101110110110101101111110110110010001101110111110011010100101101000011110001110000010100010101110110110000011101111101110011010000101100000101101000110001001111010101011001010100111111001011010101110000100111011001000101110101010111010111001011010001111111001111011010010010100111011001000111110000111111010111001110010010001111001011010110110000011 e4ba94ceb3eba3a2e4bdbeebb6b7ed91bbe6a5a1e3828aed83bee68582d189eab2a7e5ab84ec8baaeb968fe7b494ec8f87eb9c91e5ad83
UHC 五γ룢佾붷푻楡り탾慂щ겧嫄싪떏純쏇뜑孃 1110011111101001101001011110001110001111100110111110110011101011100101001110010110111110100001111110101011111000101010101110101010110101100110101110100110111101101011001110101110000001101110011110101010110001100110101110100010001011101001011110001011101101100110111110110110001101100101001110010110111110 e7e9a5e38f9beceb94e5be87eaf8aaeab59ae9bdaceb81b9eab19ae88ba5e2ed9bed8d94e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)