To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繞????????娃?8有??淞る?雅?? 11100011100001010011111100111111001111110011111100111111001111110011111100111111100010001010000100111111100000100101011110010111010011000011111100111111100111111100001010000010111010010011111110001001111010110011111100111111 e3853f3f3f3f3f3f3f3f88a13f8257974c3f3f9fc282e93f89eb3f3f
EUC-JP 繞?????洹??娃?8有??淞る?雅?? 111001011110010100111111001111110011111100111111001111111000111111000111101110100011111100111111101100001010001100111111101000111011100011001101101011010011111100111111110111101100010010100100111010110011111110110010111011010011111100111111 e5e53f3f3f3f3f8fc7ba3f3fb0a33fa3b8cdad3f3fdec4a4eb3fb2ed3f3f
UTF-8 繞볤퀡理롣빳洹앹뿉娃븍8有닷슖淞る듋雅뚰뒑 111001111011100110011110111010111011001110100100111011011000000010100001111011111010011110100100111010111010000110100011111010111011100110110011111001101011010010111001111011001001010110111001111010111011111110001001111001011010100010000011111010111011100010001101111011111011110010011000111001101001110010001001111010111000101110110111111011001000101010010110111001101011011110011110111000111000001010001011111010111001001110001011111010011001101110000101111010111001101010110000111010111001001010010001 e7b99eebb3a4ed80a1efa7a4eba1a3ebb9b3e6b4b9ec95b9ebbf89e5a883ebb88defbc98e69c89eb8bb7ec8a96e6b79ee3828beb938be99b85eb9ab0eb9291
UHC 繞볤퀡理롣빳洹앹뿉娃븍8有닷슖淞る듋雅뚰뒑 111010011010010010010011111010101011001110010101111011001011010110001110111001001011101110100101111010101011011110011101111011001001011110010000111010001101111110111010111010111010001110111000111010101111001110110100111001011001101010100101111000011110011110101010111010111000101010111110111001001011101010001100111011011000101010001110 e9a493eab395ecb58ee4bba5eab79dec9790e8dfbaeba3b8eaf3b4e59aa5e1e7aaeb8abee4ba8ced8a8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)