To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??泣ワ??κ?語⑥??②?淫??筌 11100010101000110011111100111111100010111000001110000011100011110011111100111111100000111100100000111111100011001110101010000111010001010011111100111111100001110100000100111111100010001111101000111111001111111110001010100011 e2a33f3f8b83838f3f3f83c83f8cea87453f3f87413f88fa3f3fe2a3
EUC-JP 筌??泣ワ?洹κ?語??嫄??淫??筌 111001001010010100111111001111111011010111100011101001011110111100111111100011111100011110111010101001101100101000111111101110001110110000111111001111111000111110111010101000010011111100111111101100001111110000111111001111111110010010100101 e4a53f3fb5e3a5ef3f8fc7baa6ca3fb8ec3f3f8fbaa13f3fb0fc3f3fe4a5
UTF-8 筌뚮뿦泣ワ㎗洹κ묘語⑥뜫嫄②퀍淫낃괬筌 1110011110101101100011001110101110011010101011101110101110111111101001101110011010110011101000111110001110000011101011111110001110001110100101111110011010110100101110011100111010111010111010111010110010011000111010001010101010011110111000101001000110100101111010111001110010101011111001011010101110000100111000101001000110100001111011011000000010001101111001101011011110101011111010111000001010000011111010101011010010101100111001111010110110001100 e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e8aa9ee291a5eb9cabe5ab84e291a1ed808de6b7abeb8283eab4ace7ad8c
UHC 筌뚮뿦泣ワ㎗洹κ묘語⑥뜫嫄②퀍淫낃괬筌 1110111110100111100011001110101110010111101001101110101111101000101010111110111110100111101000111110101010110111101001011110101010111001101001101110010111011110101010001110110010001101101011001110101010110001101010001110100010110011100000111110101111100010100001011110101010110001101010011110111110100111 efa78ceb97a6ebe8abefa7a3eab7a5eab9a6e5dea8ec8daceab1a8e8b383ebe285eab1a9efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)