To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
EUC-JP ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
UTF-8 혧횉첸창챵짙챌횞횧횉짯v혧횉첸창챵짙챌횞횧횉짯vB 111011011001100010100111111011011001101010001001111011001011001010111000111011001011000010111101111011001011000110110101111011001010011110011001111011001011000110001100111011011001101010011110111011011001101010100111111011011001101010001001111011001010011110101111011101101110110110011000101001111110110110011010100010011110110010110010101110001110110010110000101111011110110010110001101101011110110010100111100110011110110010110001100011001110110110011010100111101110110110011010101001111110110110011010100010011110110010100111101011110111011001000010 ed98a7ed9a89ecb2b8ecb0bdecb1b5eca799ecb18ced9a9eed9aa7ed9a89eca7af76ed98a7ed9a89ecb2b8ecb0bdecb1b5eca799ecb18ced9a9eed9aa7ed9a89eca7af7642
UHC 혧횉첸창챵짙챌횞횧횉짯v혧횉첸창챵짙챌횞횧횉짯vB 1100001010001111110000111000011111000011101111101100001110100010110000111011001011000010101000111100001110100111110000111001011111000011100111101100001110000111110000101010110101110110110000101000111111000011100001111100001110111110110000111010001011000011101100101100001010100011110000111010011111000011100101111100001110011110110000111000011111000010101011010111011001000010 c28fc387c3bec3a2c3b2c2a3c3a7c397c39ec387c2ad76c28fc387c3bec3a2c3b2c2a3c3a7c397c39ec387c2ad7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)