To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????iLh???????????iL 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101001010011000110100000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101001100 3f3f3f3f3f3f3f3f3f3f3f694c683f3f3f3f3f3f3f3f3f3f3f694c
SJIS-WIN 娼チ宵ウ。ォウ。ッiLh娼チ宵ウ。ォウ。ッiL 1000111110101001110000011000111110101010101100111010000111111000100011111010101110110011101000011111100010100001101011110110100101001100011010001000111110101001110000011000111110101010101100111010000111111000100011111010101110110011101000011111100010100001101011110110100101001100 8fa9c18faab3a1f88fabb3a1f8a1af694c688fa9c18faab3a1f88fabb3a1f8a1af694c
EUC-JP 娼チ宵ウ。?ォウ。?ッiLh娼チ宵ウ。?ォウ。?ッiL 101111101010101110001110110000011011111010101100100011101011001110001110101000010011111110001110101010111000111010110011100011101010000100111111100011101010111101101001010011000110100010111110101010111000111011000001101111101010110010001110101100111000111010100001001111111000111010101011100011101011001110001110101000010011111110001110101011110110100101001100 beab8ec1beac8eb38ea13f8eab8eb38ea13f8eaf694c68beab8ec1beac8eb38ea13f8eab8eb38ea13f8eaf694c
UTF-8 娼チ宵ウ。ォウ。ッiLh娼チ宵ウ。ォウ。ッiL 1110010110101000101111001110111110111110100000011110010110101110101101011110111110111101101100111110111110111101101000011110111010011000101011101110111110111101101010111110111110111101101100111110111110111101101000011110111010011001100000001110111110111101101011110110100101001100011010001110010110101000101111001110111110111110100000011110010110101110101101011110111110111101101100111110111110111101101000011110111010011000101011101110111110111101101010111110111110111101101100111110111110111101101000011110111010011001100000001110111110111101101011110110100101001100 e5a8bcefbe81e5aeb5efbdb3efbda1ee98aeefbdabefbdb3efbda1ee9980efbdaf694c68e5a8bcefbe81e5aeb5efbdb3efbda1ee98aeefbdabefbdb3efbda1ee9980efbdaf694c
UHC 娼?宵????????iLh娼?宵????????iL 11110011110111100011111111100001101100100011111100111111001111110011111100111111001111110011111100111111011010010100110001101000111100111101111000111111111000011011001000111111001111110011111100111111001111110011111100111111001111110110100101001100 f3de3fe1b23f3f3f3f3f3f3f3f694c68f3de3fe1b23f3f3f3f3f3f3f3f694c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)