To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 銹???舍獅??銹?語銹???舍獅??銹??^ 1110011111110110001111110011111100111111111001000111000110001110100000100011111100111111111001111111011000111111100011001110101011100111111101100011111100111111001111111110010001110001100011101000001000111111001111111110011111110110001111110011111101011110 e7f63f3f3fe4718e823f3fe7f63f8ceae7f63f3f3fe4718e823f3fe7f63f3f5e
EUC-JP 銹???舍獅??銹?語銹???舍獅??銹?瘀^ 11101110111110000011111100111111001111111110011111010010101110111110001000111111001111111110111011111000001111111011100011101100111011101111100000111111001111110011111111100111110100101011101111100010001111110011111111101110111110000011111110001111110011011110001101011110 eef83f3f3fe7d2bbe23f3feef83fb8eceef83f3f3fe7d2bbe23f3feef83f8fcde35e
UTF-8 銹뤒뷜헤舍獅헥쨘銹롐語銹뤒뷜헤舍獅헥쨘銹롐瘀^ 11101001100010101011100111101011101001001001001011101011101101111001110011101101100101111010010011101000100010001000110111100111100011011000010111101101100101111010010111101100101010001001100011101001100010101011100111101011101000011001000011101000101010101001111011101001100010101011100111101011101001001001001011101011101101111001110011101101100101111010010011101000100010001000110111100111100011011000010111101101100101111010010111101100101010001001100011101001100010101011100111101011101000011001000011100111100110001000000001011110 e98ab9eba492ebb79ced97a4e8888de78d85ed97a5eca898e98ab9eba190e8aa9ee98ab9eba492ebb79ced97a4e8888de78d85ed97a5eca898e98ab9eba190e798805e
UHC 銹뤒뷜헤舍獅헥쨘銹롐語銹뤒뷜헤舍獅헥쨘銹롐瘀^ 111000101100100010001111110000101011101011100010110001111110110011011110111011001101111011100010110001111110110111000010101110101110001011001000100011101101011011100101110111101110001011001000100011111100001010111010111000101100011111101100110111101110110011011110111000101100011111101101110000101011101011100010110010001000111011010110111001011101110001011110 e2c88fc2bae2c7ecdeecdee2c7edc2bae2c88ed6e5dee2c88fc2bae2c7ecdeecdee2c7edc2bae2c88ed6e5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)