To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ??????????斤??匡?????發h 001111110011111100111111001111110011111100111111001111110011111100111111001111111000101111010010001111110011111110001011101001110011111100111111001111110011111100111111111000011010001001101000 3f3f3f3f3f3f3f3f3f3f8bd23f3f8ba73f3f3f3f3fe1a268
EUC-JP ?????????珏斤??匡?????發h 0011111100111111001111110011111100111111001111110011111100111111001111111000111111001011111011011011011011010100001111110011111110110110101010010011111100111111001111110011111100111111111000101010010001101000 3f3f3f3f3f3f3f3f3f8fcbedb6d43f3fb6a93f3f3f3f3fe2a468
UTF-8 렻렒렺렫렺렊렻렖렺珏斤렖렺匡렓렻렖렺후發h 11101011101000001011101111101011101000001001001011101011101000001011101011101011101000001010101111101011101000001011101011101011101000001000101011101011101000001011101111101011101000001001011011101011101000001011101011100111100011111000111111100110100101101010010011101011101000001001011011101011101000001011101011100101100011001010000111101011101000001001001111101011101000001011101111101011101000001001011011101011101000001011101011101101100110111000010011100111100110011011110001101000 eba0bbeba092eba0baeba0abeba0baeba08aeba0bbeba096eba0bae78f8fe696a4eba096eba0bae58ca1eba093eba0bbeba096eba0baed9b84e799bc68
UHC 렻렒렺렫렺렊렻렖렺珏斤렖렺匡렓렻렖렺후發h 1000111011000011100011101010011110001110110000101000111010111001100011101100001010001110101000011000111011000011100011101010101110001110110000101100101011000100110100001100010110001110101010111000111011000010110011101100010010001110101010001000111011000011100011101010101110001110110000101100100011000100110110111010000101101000 8ec38ea78ec28eb98ec28ea18ec38eab8ec2cac4d0c58eab8ec2cec48ea88ec38eab8ec2c8c4dba168

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)