To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 畏?????饒??}畏?????饒??{^ 10001000110110000011111100111111001111110011111100111111111010010110000000111111001111110111110110001000110110000011111100111111001111110011111100111111111010010110000000111111001111110111101101011110 88d83f3f3f3f3fe9603f3f7d88d83f3f3f3f3fe9603f3f7b5e
EUC-JP 畏?????饒??}畏?????饒??{^ 10110000110110100011111100111111001111110011111100111111111100011100000100111111001111110111110110110000110110100011111100111111001111110011111100111111111100011100000100111111001111110111101101011110 b0da3f3f3f3f3ff1c13f3f7db0da3f3f3f3f3ff1c13f3f7b5e
UTF-8 畏먫뮅鍊깍풒饒껒뮇}畏먫뮅鍊깍풒饒껒뮇{^ 111001111001010110001111111010111010100010101011111010111010111010000101111011111010011010011011111010101011100110001101111011011001001010010010111010011010010110010010111010101011101110010010111010111010111010000111011111011110011110010101100011111110101110101000101010111110101110101110100001011110111110100110100110111110101010111001100011011110110110010010100100101110100110100101100100101110101010111011100100101110101110101110100001110111101101011110 e7958feba8abebae85efa69beab98ded9292e9a592eabb92ebae877de7958feba8abebae85efa69beab98ded9292e9a592eabb92ebae877b5e
UHC 畏먫뮅鍊깍풒饒껒뮇}畏먫뮅鍊깍풒饒껒뮇{^ 111010001110011010010000111010001001001010010100111001101110100010110001111011111011111010010110111010011010111010000011111011101001001010010110011111011110100011100110100100001110100010010010100101001110011011101000101100011110111110111110100101101110100110101110100000111110111010010010100101100111101101011110 e8e690e89294e6e8b1efbe96e9ae83ee92967de8e690e89294e6e8b1efbe96e9ae83ee92967b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)