To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????F?????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000110001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f463f3f3f3f3f3f
SJIS-WIN 永??泣?永?????違??F永????Ⅹ 100010010110100100111111001111111000101110000011001111111000100101101001001111110011111100111111001111110011111110001000111000010011111100111111010001101000100101101001001111110011111100111111001111111000011101011101 89693f3f8b833f89693f3f3f3f3f88e13f3f4689693f3f3f3f875d
EUC-JP 永??泣?永??沅??違??F永??沅?? 101100011100101000111111001111111011010111100011001111111011000111001010001111110011111110001111110001101110100100111111001111111011000011100011001111110011111101000110101100011100101000111111001111111000111111000110111010010011111100111111 b1ca3f3fb5e33fb1ca3f3f8fc6e93f3fb0e33f3f46b1ca3f3f8fc6e93f3f
UTF-8 永띔퍜泣쩎永띔퍌沅쀧쨼違먯뒫F永띔퍌沅쀯Ⅹ 11100110101100001011100011101011100111011001010011101101100011011001110011100110101100111010001111101100101010011000111011100110101100001011100011101011100111011001010011101101100011011000110011100110101100101000010111101100100000001010011111101100101010001011110011101001100000011001010111101011101010001010111111101011100100101010101101000110111001101011000010111000111010111001110110010100111011011000110110001100111001101011001010000101111011001000000010101111111000101000010110101001 e6b0b8eb9d94ed8d9ce6b3a3eca98ee6b0b8eb9d94ed8d8ce6b285ec80a7eca8bce98195eba8afeb92ab46e6b0b8eb9d94ed8d8ce6b285ec80afe285a9
UHC 永띔퍜泣쩎永띔퍌沅쀧쨼違먯뒫F永띔퍌沅쀯Ⅹ 1110011110110101101101101110101010111011100100111110101111101000101001010100011011100111101101011011011011101010101110111000001111101010101101101001011111100111101001001001011011101010110111101001000011101100100010101010010101000110111001111011010110110110111010101011101110000011111010101011011010010111111011111010010110111001 e7b5b6eabb93ebe8a546e7b5b6eabb83eab697e7a496eade90ec8aa546e7b5b6eabb83eab697efa5b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)