To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥韋???с?艶N??癲?8????異 111001011111000100111111100000010110000111101000111010000011111100111111001111111000010010000011001111111000100110010000100000100110110100111111001111111110000110011111001111111000001001010111001111110011111100111111001111111000100011011001 e5f13f8161e8e83f3f3f84833f8990826d3f3fe19f3f82573f3f3f3f88d9
EUC-JP 褥?‖韋???с?艶N??癲?8????異 111010101111001100111111101000011100001011110000111010100011111100111111001111111010011111100011001111111011000111110000101000111100111000111111001111111110001010100001001111111010001110111000001111110011111100111111001111111011000011011011 eaf33fa1c2f0ea3f3f3fa7e33fb1f0a3ce3f3fe2a13fa3b83f3f3f3fb0db
UTF-8 褥띕∥韋뤻씣戮с걶艶N쎈떑癲쒕8梨뜹젆짰異 1110100010100100101001011110101110011101100101011110001010001000101001011110100110011111100010111110101110100100101110111110110010010100101000111110111110100111100100101101000110000001111010101011000110110110111010001000100110110110111011111011110010101110111011001000111010001000111010111001011010010001111001111001100110110010111011001001001010010101111011111011110010011000111011111010011110100010111010111001110010111001111011001010000010000110111011001010011110110000111001111001010110110000 e8a4a5eb9d95e288a5e99f8beba4bbec94a3efa792d181eab1b6e889b6efbcaeec8e88eb9691e799b2ec9295efbc98efa7a2eb9cb9eca086eca7b0e795b0
UHC 褥띕∥韋뤻씣戮с걶艶N쎈떑癲쒕8梨뜹젆짰異 111010011011001110110110111010111010000110101011111010101101111110001111111010011001110110110111111010111011110110101100111000111000000110011100111001101111110110100011110011101011110111101011100010111010011111101111101001101001110011101011101000111011100011101100101100011011011011100101101000001000100111000010101011101110110010110110 e9b3b6eba1abeadf8fe99db7ebbdace3819ce6fda3cebdeb8ba7efa69ceba3b8ecb1b6e5a089c2aeecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)