To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??泣ワ??κ?沃??筌??泣ワ??κ?沃??B 1110001010100011001111110011111110001011100000111000001110001111001111110011111110000011110010000011111110010111100000000011111100111111111000101010001100111111001111111000101110000011100000111000111100111111001111111000001111001000001111111001011110000000001111110011111101000010 e2a33f3f8b83838f3f3f83c83f97803f3fe2a33f3f8b83838f3f3f83c83f97803f3f42
EUC-JP 筌??泣ワ?洹κ?沃??筌??泣ワ?洹κ?沃??B 111001001010010100111111001111111011010111100011101001011110111100111111100011111100011110111010101001101100101000111111110011011110000000111111001111111110010010100101001111110011111110110101111000111010010111101111001111111000111111000111101110101010011011001010001111111100110111100000001111110011111101000010 e4a53f3fb5e3a5ef3f8fc7baa6ca3fcde03f3fe4a53f3fb5e3a5ef3f8fc7baa6ca3fcde03f3f42
UTF-8 筌뚮뿦泣ワ㎗洹κ묘沃띿짗筌뚮뿦泣ワ㎗洹κ묘沃띿짗B 1110011110101101100011001110101110011010101011101110101110111111101001101110011010110011101000111110001110000011101011111110001110001110100101111110011010110100101110011100111010111010111010111010110010011000111001101011001010000011111010111001110110111111111011001010011110010111111001111010110110001100111010111001101010101110111010111011111110100110111001101011001110100011111000111000001110101111111000111000111010010111111001101011010010111001110011101011101011101011101011001001100011100110101100101000001111101011100111011011111111101100101001111001011101000010 e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e6b283eb9dbfeca797e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e6b283eb9dbfeca79742
UHC 筌뚮뿦泣ワ㎗洹κ묘沃띿짗筌뚮뿦泣ワ㎗洹κ묘沃띿짗B 11101111101001111000110011101011100101111010011011101011111010001010101111101111101001111010001111101010101101111010010111101010101110011010011011101000101010101000110111101100101000111001111011101111101001111000110011101011100101111010011011101011111010001010101111101111101001111010001111101010101101111010010111101010101110011010011011101000101010101000110111101100101000111001111001000010 efa78ceb97a6ebe8abefa7a3eab7a5eab9a6e8aa8deca39eefa78ceb97a6ebe8abefa7a3eab7a5eab9a6e8aa8deca39e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)