To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 域??染?????D域??染?????D^ 10001000111001100011111100111111100100001111010100111111001111110011111100111111001111110100010010001000111001100011111100111111100100001111010100111111001111110011111100111111001111110100010001011110 88e63f3f90f53f3f3f3f3f4488e63f3f90f53f3f3f3f3f445e
EUC-JP 域??染?????D域??染?????D^ 10110000111010000011111100111111110000001111011100111111001111110011111100111111001111110100010010110000111010000011111100111111110000001111011100111111001111110011111100111111001111110100010001011110 b0e83f3fc0f73f3f3f3f3f44b0e83f3fc0f73f3f3f3f3f445e
UTF-8 域⑶뒔染숁누僚묋랜D域⑶뒔染숁누僚묋랜D^ 111001011001111110011111111000101001000110110110111010111001001010010100111001101001111110010011111011001000100010000001111010111000100010000100111011111010011010111011111010111010110010001011111010111001111010011100010001001110010110011111100111111110001010010001101101101110101110010010100101001110011010011111100100111110110010001000100000011110101110001000100001001110111110100110101110111110101110101100100010111110101110011110100111000100010001011110 e59f9fe291b6eb9294e69f93ec8881eb8884efa6bbebac8beb9e9c44e59f9fe291b6eb9294e69f93ec8881eb8884efa6bbebac8beb9e9c445e
UHC 域⑶뒔染숁누僚묋랜D域⑶뒔染숁누僚묋랜D^ 111001101011010010101001111010011000101010010001111001101111100010011001111001101011010010101001111010001110100010010001111010001011011110100011010001001110011010110100101010011110100110001010100100011110011011111000100110011110011010110100101010011110100011101000100100011110100010110111101000110100010001011110 e6b4a9e98a91e6f899e6b4a9e8e891e8b7a344e6b4a9e98a91e6f899e6b4a9e8e891e8b7a3445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)