To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??揖х??⑥?永??泣①?猿??永??魏ワⅨ 100010010110100100111111001111111001011101001011100001001000011100111111001111111000011101000101001111111000100101101001001111110011111110001011100000111000011101000000001111111000100110001110001111110011111110001001011010010011111100111111111010011011000010000011100011111000011101011100 89693f3f974b84873f3f87453f89693f3f8b8387403f898e3f3f89693f3fe9b0838f875c
EUC-JP 永??揖х?洹??永??泣??猿??永??魏ワ? 1011000111001010001111110011111111001101101011001010011111100111001111111000111111000111101110100011111100111111101100011100101000111111001111111011010111100011001111110011111110110001111011100011111100111111101100011100101000111111001111111111001010110010101001011110111100111111 b1ca3f3fcdaca7e73f8fc7ba3f3fb1ca3f3fb5e33f3fb1ee3f3fb1ca3f3ff2b2a5ef3f
UTF-8 永띔퍜揖х독洹⑥뎾永띔랬泣①독猿볦뎽永띔래魏ワⅨ 1110011010110000101110001110101110011101100101001110110110001101100111001110011010001111100101101101000110000101111010111000111110000101111001101011010010111001111000101001000110100101111010111000111010111110111001101011000010111000111010111001110110010100111010111001111010101100111001101011001110100011111000101001000110100000111010111000111110000101111001111000110010111111111010111011001110100110111010111000111010111101111001101011000010111000111010111001110110010100111010111001111010011000111010011010110110001111111000111000001110101111111000101000010110101000 e6b0b8eb9d94ed8d9ce68f96d185eb8f85e6b4b9e291a5eb8ebee6b0b8eb9d94eb9eace6b3a3e291a0eb8f85e78cbfebb3a6eb8ebde6b0b8eb9d94eb9e98e9ad8fe383afe285a8
UHC 永띔퍜揖х독洹⑥뎾永띔랬泣①독猿볦뎽永띔래魏ワⅨ 111001111011010110110110111010101011101110010011111010111110011110101100111001111011010110110110111010101011011110101000111011001000100110010001111001111011010110110110111010101011011110101000111010111110100010101000111001111011010110110110111010101011101110010011111011001000100110010000111001111011010110110110111010101011011110100001111010101110000010101011111011111010010110111000 e7b5b6eabb93ebe7ace7b5b6eab7a8ec8991e7b5b6eab7a8ebe8a8e7b5b6eabb93ec8990e7b5b6eab7a1eae0abefa5b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)