To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲??唯?????沃??癲??唯?????沃??B 11100001100111110011111100111111100101110100001000111111001111110011111100111111001111111001011110000000001111110011111111100001100111110011111100111111100101110100001000111111001111110011111100111111001111111001011110000000001111110011111101000010 e19f3f3f97423f3f3f3f3f97803f3fe19f3f3f97423f3f3f3f3f97803f3f42
EUC-JP 癲??唯?????沃??癲??唯?????沃??B 11100010101000010011111100111111110011011010001100111111001111110011111100111111001111111100110111100000001111110011111111100010101000010011111100111111110011011010001100111111001111110011111100111111001111111100110111100000001111110011111101000010 e2a13f3fcda33f3f3f3f3fcde03f3fe2a13f3fcda33f3f3f3f3fcde03f3f42
UTF-8 癲용뿨唯녽굲紐뚰뭽沃욎챳癲용뿨唯녽굲紐뚰뭽沃욎챳B 11100111100110011011001011101100100110101010100111101011101111111010100011100101100101001010111111101011100001011011110111101010101101011011001011101111101001111000111111101011100110101011000011101011101011011011110111100110101100101000001111101100100110101000111011101100101100011011001111100111100110011011001011101100100110101010100111101011101111111010100011100101100101001010111111101011100001011011110111101010101101011011001011101111101001111000111111101011100110101011000011101011101011011011110111100110101100101000001111101100100110101000111011101100101100011011001101000010 e799b2ec9aa9ebbfa8e594afeb85bdeab5b2efa78feb9ab0ebadbde6b283ec9a8eecb1b3e799b2ec9aa9ebbfa8e594afeb85bdeab5b2efa78feb9ab0ebadbde6b283ec9a8eecb1b342
UHC 癲용뿨唯녽굲紐뚰뭽沃욎챳癲용뿨唯녽굲紐뚰뭽沃욎챳B 11101111101001101011111111101011100101111010100011101010111001101000011011101001100000101001010111101011101010101000110011101101100100101000110011101000101010101001111011101100101010101000000111101111101001101011111111101011100101111010100011101010111001101000011011101001100000101001010111101011101010101000110011101101100100101000110011101000101010101001111011101100101010101000000101000010 efa6bfeb97a8eae686e98295ebaa8ced928ce8aa9eecaa81efa6bfeb97a8eae686e98295ebaa8ced928ce8aa9eecaa8142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)