To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?似?????瑙剛}v?似?????瑙剛}vB 0011111110001110100101110011111100111111001111110011111100111111111000001110110110001101100001000111110101110110001111111000111010010111001111110011111100111111001111110011111111100000111011011000110110000100011111010111011001000010 3f8e973f3f3f3f3fe0ed8d847d763f8e973f3f3f3f3fe0ed8d847d7642
EUC-JP ?似?????瑙剛}v?似?????瑙剛}vB 0011111110111011111101110011111100111111001111110011111100111111111000001110111110111001111001000111110101110110001111111011101111110111001111110011111100111111001111110011111111100000111011111011100111100100011111010111011001000010 3fbbf73f3f3f3f3fe0efb9e47d763fbbf73f3f3f3f3fe0efb9e47d7642
UTF-8 렻似렩렻씽렍렻瑙剛}v렻似렩렻씽렍렻瑙剛}vB 1110101110100000101110111110010010111100101111001110101110100000101010011110101110100000101110111110110010010100101111011110101110100000100011011110101110100000101110111110011110010001100110011110010110001001100110110111110101110110111010111010000010111011111001001011110010111100111010111010000010101001111010111010000010111011111011001001010010111101111010111010000010001101111010111010000010111011111001111001000110011001111001011000100110011011011111010111011001000010 eba0bbe4bcbceba0a9eba0bbec94bdeba08deba0bbe79199e5899b7d76eba0bbe4bcbceba0a9eba0bbec94bdeba08deba0bbe79199e5899b7d7642
UHC 렻似렩렻씽렍렻瑙剛}v렻似렩렻씽렍렻瑙剛}vB 1000111011000011110111101100010010001110101101111000111011000011101111101100010110001110101000111000111011000011110100101100010111001011101001110111110101110110100011101100001111011110110001001000111010110111100011101100001110111110110001011000111010100011100011101100001111010010110001011100101110100111011111010111011001000010 8ec3dec48eb78ec3bec58ea38ec3d2c5cba77d768ec3dec48eb78ec3bec58ea38ec3d2c5cba77d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)