To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 宵リ娼テ宵モ宵苡ォ竢ォモ将苡ェ」宵ウ将 1000111110101010110110001000111110101001110000111000111110101010110100111000111110101010111001001000111110101011111000101000111110101011110100111000111110101011111001001000111110101010101000111000111110101010101100111000111110101011 8faad88fa9c38faad38faae48fabe28fabd38fabe48faaa38faab38fab
EUC-JP 宵リ娼テ宵モ宵苡ォ竢ォモ将苡ェ」宵ウ将 1011111010101100100011101101100010111110101010111000111011000011101111101010110010001110110100111011111010101100111001111110111110001110101010111110001111101111100011101010101110001110110100111011111010101101111001111110111110001110101010101000111010100011101111101010110010001110101100111011111010101101 beac8ed8beab8ec3beac8ed3beace7ef8eabe3ef8eab8ed3beade7ef8eaa8ea3beac8eb3bead
UTF-8 宵リ娼テ宵モ宵苡ォ竢ォモ将苡ェ」宵ウ将 111001011010111010110101111011111011111010011000111001011010100010111100111011111011111010000011111001011010111010110101111011111011111010010011111001011010111010110101111010001000101110100001111011111011110110101011111001111010101110100010111011111011110110101011111011111011111010010011111001011011000010000110111010001000101110100001111011111011110110101010111011111011110110100011111001011010111010110101111011111011110110110011111001011011000010000110 e5aeb5efbe98e5a8bcefbe83e5aeb5efbe93e5aeb5e88ba1efbdabe7aba2efbdabefbe93e5b086e88ba1efbdaaefbda3e5aeb5efbdb3e5b086
UHC 宵?娼?宵?宵苡?????苡??宵?? 1110000110110010001111111111001111011110001111111110000110110010001111111110000110110010111011001011111000111111001111110011111100111111001111111110110010111110001111110011111111100001101100100011111100111111 e1b23ff3de3fe1b23fe1b2ecbe3f3f3f3f3fecbe3f3fe1b23f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)