To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????f?????^}Y?????f?????^}bE 0011111100111111001111110011111100111111011001100011111100111111001111110011111100111111010111100111110101011001001111110011111100111111001111110011111101100110001111110011111100111111001111110011111101011110011111010110001001000101 3f3f3f3f3f663f3f3f3f3f5e7d593f3f3f3f3f663f3f3f3f3f5e7d6245
SJIS-WIN 絶??嚥?f絶??嚥?^}Y絶??嚥?f絶??嚥?^}bE 10010000111000100011111100111111100110101000101100111111011001101001000011100010001111110011111110011010100010110011111101011110011111010101100110010000111000100011111100111111100110101000101100111111011001101001000011100010001111110011111110011010100010110011111101011110011111010110001001000101 90e23f3f9a8b3f6690e23f3f9a8b3f5e7d5990e23f3f9a8b3f6690e23f3f9a8b3f5e7d6245
EUC-JP 絶??嚥?f絶??嚥?^}Y絶??嚥?f絶??嚥?^}bE 11000000111001000011111100111111110100111110101100111111011001101100000011100100001111110011111111010011111010110011111101011110011111010101100111000000111001000011111100111111110100111110101100111111011001101100000011100100001111110011111111010011111010110011111101011110011111010110001001000101 c0e43f3fd3eb3f66c0e43f3fd3eb3f5e7d59c0e43f3fd3eb3f66c0e43f3fd3eb3f5e7d6245
UTF-8 絶뗩씍嚥쁭f絶뗩씍嚥쁭^}Y絶뗩씍嚥쁭f絶뗩씍嚥쁭^}bE 111001111011010110110110111010111001011110101001111011001001010010001101111001011001101010100101111011001000000110101101011001101110011110110101101101101110101110010111101010011110110010010100100011011110010110011010101001011110110010000001101011010101111001111101010110011110011110110101101101101110101110010111101010011110110010010100100011011110010110011010101001011110110010000001101011010110011011100111101101011011011011101011100101111010100111101100100101001000110111100101100110101010010111101100100000011010110101011110011111010110001001000101 e7b5b6eb97a9ec948de59aa5ec81ad66e7b5b6eb97a9ec948de59aa5ec81ad5e7d59e7b5b6eb97a9ec948de59aa5ec81ad66e7b5b6eb97a9ec948de59aa5ec81ad5e7d6245
UHC 絶뗩씍嚥쁭f絶뗩씍嚥쁭^}Y絶뗩씍嚥쁭f絶뗩씍嚥쁭^}bE 11101111101111101000101111101001100111011010010011100110101111111001100001101110011001101110111110111110100010111110100110011101101001001110011010111111100110000110111001011110011111010101100111101111101111101000101111101001100111011010010011100110101111111001100001101110011001101110111110111110100010111110100110011101101001001110011010111111100110000110111001011110011111010110001001000101 efbe8be99da4e6bf986e66efbe8be99da4e6bf986e5e7d59efbe8be99da4e6bf986e66efbe8be99da4e6bf986e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)