To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????TB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN 永??揖?????癲??揖х??⑥?永??鍮?TB 100010010110100100111111001111111001011101001011001111110011111100111111001111110011111111100001100111110011111100111111100101110100101110000100100001110011111100111111100001110100010100111111100010010110100100111111001111111110100001001010001111110101010001000010 89693f3f974b3f3f3f3f3fe19f3f3f974b84873f3f87453f89693f3fe84a3f5442
EUC-JP 永??揖??洹??癲??揖х?洹??永??鍮?TB 101100011100101000111111001111111100110110101100001111110011111110001111110001111011101000111111001111111110001010100001001111110011111111001101101011001010011111100111001111111000111111000111101110100011111100111111101100011100101000111111001111111110111110101011001111110101010001000010 b1ca3f3fcdac3f3f8fc7ba3f3fe2a13f3fcdaca7e73f8fc7ba3f3fb1ca3f3fefab3f5442
UTF-8 永띔퍜揖덄독洹욎틯癲㏃슱揖х독洹⑥돱永띔퇊鍮펔TB 11100110101100001011100011101011100111011001010011101101100011011001110011100110100011111001011011101011100011011000010011101011100011111000010111100110101101001011100111101100100110101000111011101101100010111010111111100111100110011011001011100011100011111000001111101100100010101011000111100110100011111001011011010001100001011110101110001111100001011110011010110100101110011110001010010001101001011110101110001111101100011110011010110000101110001110101110011101100101001110110110000111100010101110100110001101101011101110110110001110100101000101010001000010 e6b0b8eb9d94ed8d9ce68f96eb8d84eb8f85e6b4b9ec9a8eed8bafe799b2e38f83ec8ab1e68f96d185eb8f85e6b4b9e291a5eb8fb1e6b0b8eb9d94ed878ae98daeed8e945442
UHC 永띔퍜揖덄독洹욎틯癲㏃슱揖х독洹⑥돱永띔퇊鍮펔TB 111001111011010110110110111010101011101110010011111010111110011110001000111001111011010110110110111010101011011110011110111011001011101010011001111011111010011010100111111011001001101010111000111010111110011110101100111001111011010110110110111010101011011110101000111011001000100110110100111001111011010110110110111010101011011110011011111010111011100110111100011010000101010001000010 e7b5b6eabb93ebe788e7b5b6eab79eecba99efa6a7ec9ab8ebe7ace7b5b6eab7a8ec89b4e7b5b6eab79bebb9bc685442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)