To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????v[????v????v[^ 0011111100111111001111110011111101110110001111110011111100111111001111110111011001011011001111110011111100111111001111110111011000111111001111110011111100111111011101100101101101011110 3f3f3f3f763f3f3f3f765b3f3f3f3f763f3f3f3f765b5e
SJIS-WIN 贈???v贈???v[贈???v贈???v[^ 100100011010000100111111001111110011111101110110100100011010000100111111001111110011111101110110010110111001000110100001001111110011111100111111011101101001000110100001001111110011111100111111011101100101101101011110 91a13f3f3f7691a13f3f3f765b91a13f3f3f7691a13f3f3f765b5e
EUC-JP 贈???v贈???v[贈???v贈???v[^ 110000101010001100111111001111110011111101110110110000101010001100111111001111110011111101110110010110111100001010100011001111110011111100111111011101101100001010100011001111110011111100111111011101100101101101011110 c2a33f3f3f76c2a33f3f3f765bc2a33f3f3f76c2a33f3f3f765b5e
UTF-8 贈숄렰렟v贈숄렰렟v[贈숄렰렟v贈숄렰렟v[^ 11101000101101001000100011101100100010001000010011101011101000001011000011101011101000001001111101110110111010001011010010001000111011001000100010000100111010111010000010110000111010111010000010011111011101100101101111101000101101001000100011101100100010001000010011101011101000001011000011101011101000001001111101110110111010001011010010001000111011001000100010000100111010111010000010110000111010111010000010011111011101100101101101011110 e8b488ec8884eba0b0eba09f76e8b488ec8884eba0b0eba09f765be8b488ec8884eba0b0eba09f76e8b488ec8884eba0b0eba09f765b5e
UHC 贈숄렰렟v贈숄렰렟v[贈숄렰렟v贈숄렰렟v[^ 111100011111110010111100111100011000111010111101100011101011000001110110111100011111110010111100111100011000111010111101100011101011000001110110010110111111000111111100101111001111000110001110101111011000111010110000011101101111000111111100101111001111000110001110101111011000111010110000011101100101101101011110 f1fcbcf18ebd8eb076f1fcbcf18ebd8eb0765bf1fcbcf18ebd8eb076f1fcbcf18ebd8eb0765b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)