To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔?吟?衣絲??畯貊?弔?吟?衣絲??畯貊?^ 1001001010100010001111111000101111100001001111111000100011011111111000110100111000111111001111111111101101101111111001101011101100111111100100101010001000111111100010111110000100111111100010001101111111100011010011100011111100111111111110110110111111100110101110110011111101011110 92a23f8be13f88dfe34e3f3ffb6fe6bb3f92a23f8be13f88dfe34e3f3ffb6fe6bb3f5e
EUC-JP 弔?吟?衣絲??畯貊?弔?吟?衣絲??畯貊?^ 11000100101001000011111110110110111000110011111110110000111000011110010110101111001111110011111110001111110011011011101111101100101111010011111111000100101001000011111110110110111000110011111110110000111000011110010110101111001111110011111110001111110011011011101111101100101111010011111101011110 c4a43fb6e33fb0e1e5af3f3f8fcdbbecbd3fc4a43fb6e33fb0e1e5af3f3f8fcdbbecbd3f5e
UTF-8 弔렲吟렞衣絲렕렟畯貊긺弔렲吟렞衣絲렕렟畯貊긺^ 11100101101111001001010011101011101000001011001011100101100100001001111111101011101000001001111011101000101000011010001111100111101101011011001011101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101010101110001011101011100101101111001001010011101011101000001011001011100101100100001001111111101011101000001001111011101000101000011010001111100111101101011011001011101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101010101110001011101001011110 e5bc94eba0b2e5909feba09ee8a1a3e7b5b2eba095eba09fe795afe8b28aeab8bae5bc94eba0b2e5909feba09ee8a1a3e7b5b2eba095eba09fe795afe8b28aeab8ba5e
UHC 弔렲吟렞衣絲렕렟畯貊긺弔렲吟렞衣絲렕렟畯貊긺^ 111100001100000010001110101111111110101111100001100011101010111111101011111111011101111011101010100011101010101010001110101100001111000111100001110110001110011110110001111001111111000011000000100011101011111111101011111000011000111010101111111010111111110111011110111010101000111010101010100011101011000011110001111000011101100011100111101100011110011101011110 f0c08ebfebe18eafebfddeea8eaa8eb0f1e1d8e7b1e7f0c08ebfebe18eafebfddeea8eaa8eb0f1e1d8e7b1e75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)