To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??泣ワ??κ?筌???⑨?蹂μ?癲 11100010101000110011111100111111100010111000001110000011100011110011111100111111100000111100100000111111111000101010001100111111001111110011111110000111010010000011111111100110111110001000001111001010001111111110000110011111 e2a33f3f8b83838f3f3f83c83fe2a33f3f3f87483fe6f883ca3fe19f
EUC-JP 筌??泣ワ?洹κ?筌??嫄??蹂μ?癲 11100100101001010011111100111111101101011110001110100101111011110011111110001111110001111011101010100110110010100011111111100100101001010011111100111111100011111011101010100001001111110011111111101100111110101010011011001100001111111110001010100001 e4a53f3fb5e3a5ef3f8fc7baa6ca3fe4a53f3f8fbaa13f3fecfaa6cc3fe2a1
UTF-8 筌뚮뿦泣ワ㎗洹κ묘筌딅쵄嫄⑨쭓蹂μ㉠癲 11100111101011011000110011101011100110101010111011101011101111111010011011100110101100111010001111100011100000111010111111100011100011101001011111100110101101001011100111001110101110101110101110101100100110001110011110101101100011001110101110010100100001011110110010110101100001001110010110101011100001001110001010010001101010001110110010101101100100111110100010111001100000101100111010111100111000111000100110100000111001111001100110110010 e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e7ad8ceb9485ecb584e5ab84e291a8ecad93e8b982cebce389a0e799b2
UHC 筌뚮뿦泣ワ㎗洹κ묘筌딅쵄嫄⑨쭓蹂μ㉠癲 1110111110100111100011001110101110010111101001101110101111101000101010111110111110100111101000111110101010110111101001011110101010111001101001101110111110100111100010101110101110101100100001101110101010110001101010001110111110100111100010111110101110110011101001011110110010101000101100011110111110100110 efa78ceb97a6ebe8abefa7a3eab7a5eab9a6efa78aebac86eab1a8efa78bebb3a5eca8b1efa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)