To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??宥??永??乙?┸???臆??瑜??^ 1110000110011111001111110011111110010111010001110011111100111111100010010110100100111111001111111000100110110011001111111000010010111101001111110011111100111111100010011011000000111111001111111110000011101111001111110011111101011110 e19f3f3f97473f3f89693f3f89b33f84bd3f3f3f89b03f3fe0ef3f3f5e
EUC-JP 癲??宥??永??乙?┸???臆??瑜??^ 1110001010100001001111110011111111001101101010000011111100111111101100011100101000111111001111111011001010110101001111111010100010111111001111110011111100111111101100101011001000111111001111111110000011110001001111110011111101011110 e2a13f3fcda83f3fb1ca3f3fb2b53fa8bf3f3f3fb2b23f3fe0f13f3f5e
UTF-8 癲용틺宥껅늿永띕쪉乙싷┸類좊땭臆덄독瑜껋굸^ 11100111100110011011001011101100100110101010100111101101100010111011101011100101101011101010010111101010101110111000010111101011100010101011111111100110101100001011100011101011100111011001010111101100101010101000100111100100101110011001100111101100100010111011011111100010100101001011100011101111101001111001000011101100101000101000101011101011100101011010110111101000100001111000011011101011100011011000010011101011100011111000010111100111100100011001110011101010101110111000101111101010101101011011100001011110 e799b2ec9aa9ed8bbae5aea5eabb85eb8abfe6b0b8eb9d95ecaa89e4b999ec8bb7e294b8efa790eca28aeb95ade88786eb8d84eb8f85e7919ceabb8beab5b85e
UHC 癲용틺宥껅늿永띕쪉乙싷┸類좊땭臆덄독瑜껋굸^ 11101111101001101011111111101011101110101010000011101010111010011000001111100110100010001000100011100111101101011011011011101011101001011000001111101011111000001001101011101111101001101011111111101011101110101010000011101011100010111000001111100101111001101000100011100111101101011011011011101011101001011000001111101100100000101001011101011110 efa6bfebbaa0eae983e68888e7b5b6eba583ebe09aefa6bfebbaa0eb8b83e5e688e7b5b6eba583ec82975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)