To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????G??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f473f3f3f3f3f3f3f
SJIS-WIN ?????????懃???G??????? 00111111001111110011111100111111001111110011111100111111001111110011111110011100111001110011111100111111001111110100011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f9ce73f3f3f473f3f3f3f3f3f3f
EUC-JP ?????????懃???G??????? 00111111001111110011111100111111001111110011111100111111001111110011111111011000111010010011111100111111001111110100011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3fd8e93f3f3f473f3f3f3f3f3f3f
UTF-8 렺씻렖렺렔렺씻렖렻懃렊렻렠G렺씻렖렺렡렺렊 11101011101000001011101011101100100101001011101111101011101000001001011011101011101000001011101011101011101000001001010011101011101000001011101011101100100101001011101111101011101000001001011011101011101000001011101111100110100001111000001111101011101000001000101011101011101000001011101111101011101000001010000001000111111010111010000010111010111011001001010010111011111010111010000010010110111010111010000010111010111010111010000010100001111010111010000010111010111010111010000010001010 eba0baec94bbeba096eba0baeba094eba0baec94bbeba096eba0bbe68783eba08aeba0bbeba0a047eba0baec94bbeba096eba0baeba0a1eba0baeba08a
UHC 렺씻렖렺렔렺씻렖렻懃렊렻렠G렺씻렖렺렡렺렊 1000111011000010101111101100010010001110101010111000111011000010100011101010100110001110110000101011111011000100100011101010101110001110110000111101000011000100100011101010000110001110110000111000111010110001010001111000111011000010101111101100010010001110101010111000111011000010100011101011001010001110110000101000111010100001 8ec2bec48eab8ec28ea98ec2bec48eab8ec3d0c48ea18ec38eb1478ec2bec48eab8ec28eb28ec28ea1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)