To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖?????擬?ザ???陰??????レⅢ 100111110101000000111111001111110011111100111111001111111000101101011011001111111000001101010101001111110011111100111111100010010100000100111111001111110011111100111111001111110011111110000011100011001000011101010110 9f503f3f3f3f3f8b5b3f83553f3f3f89413f3f3f3f3f3f838c8756
EUC-JP 蘖?????擬?ザ饔??陰??洧???レ? 110111011011000100111111001111110011111100111111001111111011010110111100001111111010010110110110100011111110100011101111001111110011111110110001101000100011111100111111100011111100011110110100001111110011111100111111101001011110110000111111 ddb13f3f3f3f3fb5bc3fa5b68fe8ef3f3fb1a23f3f8fc7b43f3f3fa5ec3f
UTF-8 蘖뽰궡劉녜걬擬듭ザ饔낃껴陰㎪뒔洧뺣뼢曆レⅢ 111010001001100010010110111010111011110110110000111010101011011010100001111011111010011110000111111010111000010110011100111010101011000110101100111001101001001110101100111010111001001110101101111000111000001010110110111010011010010110010100111010111000001010000011111010101011101110110100111010011001100110110000111000111000111010101010111010111001001010010100111001101011010010100111111010111011101010100011111010111011110010100010111011111010011010001011111000111000001110101100111000101000010110100010 e89896ebbdb0eab6a1efa787eb859ceab1ace693aceb93ade382b6e9a594eb8283eabbb4e999b0e38eaaeb9294e6b4a7ebbaa3ebbca2efa68be383ace285a2
UHC 蘖뽰궡劉녜걬擬듭ザ饔낃껴陰㎪뒔洧뺣뼢曆レⅢ 111001011110111010010110111011001000001010110100111010101110010110110011111010011000000110010101111010111111010010110101111011001010101110110110111010001011110110000101111010101011001010111000111010111110010010100111111001101000101010010001111010101111101110010101111010111001011010100101111001101011011110101011111011001010010110110010 e5ee96ec82b4eae5b3e98195ebf4b5ecabb6e8bd85eab2b8ebe4a7e68a91eafb95eb96a5e6b7abeca5b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)