To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 域??曄?????D域??曄?????D^ 10001000111001100011111100111111100111100100000000111111001111110011111100111111001111110100010010001000111001100011111100111111100111100100000000111111001111110011111100111111001111110100010001011110 88e63f3f9e403f3f3f3f3f4488e63f3f9e403f3f3f3f3f445e
EUC-JP 域??曄?????D域??曄?????D^ 10110000111010000011111100111111110110111010000100111111001111110011111100111111001111110100010010110000111010000011111100111111110110111010000100111111001111110011111100111111001111110100010001011110 b0e83f3fdba13f3f3f3f3f44b0e83f3fdba13f3f3f3f3f445e
UTF-8 域⑶뒔曄쀦누僚묋랜D域⑶뒔曄쀦누僚묋랜D^ 111001011001111110011111111000101001000110110110111010111001001010010100111001101001101110000100111011001000000010100110111010111000100010000100111011111010011010111011111010111010110010001011111010111001111010011100010001001110010110011111100111111110001010010001101101101110101110010010100101001110011010011011100001001110110010000000101001101110101110001000100001001110111110100110101110111110101110101100100010111110101110011110100111000100010001011110 e59f9fe291b6eb9294e69b84ec80a6eb8884efa6bbebac8beb9e9c44e59f9fe291b6eb9294e69b84ec80a6eb8884efa6bbebac8beb9e9c445e
UHC 域⑶뒔曄쀦누僚묋랜D域⑶뒔曄쀦누僚묋랜D^ 111001101011010010101001111010011000101010010001111001111010010110010111111001101011010010101001111010001110100010010001111010001011011110100011010001001110011010110100101010011110100110001010100100011110011110100101100101111110011010110100101010011110100011101000100100011110100010110111101000110100010001011110 e6b4a9e98a91e7a597e6b4a9e8e891e8b7a344e6b4a9e98a91e7a597e6b4a9e8e891e8b7a3445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)