To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 贈?????祭蛟??贈?????祭蛟??^ 100100011010000100111111001111110011111100111111001111111000110111010101111001011000000000111111001111111001000110100001001111110011111100111111001111110011111110001101110101011110010110000000001111110011111101011110 91a13f3f3f3f3f8dd5e5803f3f91a13f3f3f3f3f8dd5e5803f3f5e
EUC-JP 贈???孼?祭蛟??贈???孼?祭蛟??^ 11000010101000110011111100111111001111111000111110111010110000110011111110111010110101111110100111100000001111110011111111000010101000110011111100111111001111111000111110111010110000110011111110111010110101111110100111100000001111110011111101011110 c2a33f3f3f8fbac33fbad7e9e03f3fc2a33f3f3f8fbac33fbad7e9e03f3f5e
UTF-8 贈븀렪렧孼렱祭蛟렰렓贈븀렪렧孼렱祭蛟렰렓^ 11101000101101001000100011101011101110001000000011101011101000001010101011101011101000001010011111100101101011011011110011101011101000001011000111100111101001011010110111101000100110111001111111101011101000001011000011101011101000001001001111101000101101001000100011101011101110001000000011101011101000001010101011101011101000001010011111100101101011011011110011101011101000001011000111100111101001011010110111101000100110111001111111101011101000001011000011101011101000001001001101011110 e8b488ebb880eba0aaeba0a7e5adbceba0b1e7a5ade89b9feba0b0eba093e8b488ebb880eba0aaeba0a7e5adbceba0b1e7a5ade89b9feba0b0eba0935e
UHC 贈븀렪렧孼렱祭蛟렰렓贈븀렪렧孼렱祭蛟렰렓^ 1111000111111100101110101110011110001110101110001000111010110110111001011110110110001110101111101111000010101110110011101111000110001110101111011000111010101000111100011111110010111010111001111000111010111000100011101011011011100101111011011000111010111110111100001010111011001110111100011000111010111101100011101010100001011110 f1fcbae78eb88eb6e5ed8ebef0aecef18ebd8ea8f1fcbae78eb88eb6e5ed8ebef0aecef18ebd8ea85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)