To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??唯??遺??餓λ?筌??唯??遺??餓λ?B 1110001010100011001111110011111110010111010000100011111100111111100010001110001000111111001111111000100111101100100000111100100100111111111000101010001100111111001111111001011101000010001111110011111110001000111000100011111100111111100010011110110010000011110010010011111101000010 e2a33f3f97423f3f88e23f3f89ec83c93fe2a33f3f97423f3f88e23f3f89ec83c93f42
EUC-JP 筌??唯??遺??餓λ?筌??唯??遺??餓λ?B 1110010010100101001111110011111111001101101000110011111100111111101100001110010000111111001111111011001011101110101001101100101100111111111001001010010100111111001111111100110110100011001111110011111110110000111001000011111100111111101100101110111010100110110010110011111101000010 e4a53f3fcda33f3fb0e43f3fb2eea6cb3fe4a53f3fcda33f3fb0e43f3fb2eea6cb3f42
UTF-8 筌뗫툖唯뗦젔遺용돕餓λ▷筌뗫툖唯뗦젔遺용돕餓λ▷B 1110011110101101100011001110101110010111101010111110110110001000100101101110010110010100101011111110101110010111101001101110110010100000100101001110100110000001101110101110110010011010101010011110101110001111100101011110100110100100100100111100111010111011111000101001011010110111111001111010110110001100111010111001011110101011111011011000100010010110111001011001010010101111111010111001011110100110111011001010000010010100111010011000000110111010111011001001101010101001111010111000111110010101111010011010010010010011110011101011101111100010100101101011011101000010 e7ad8ceb97abed8896e594afeb97a6eca094e981baec9aa9eb8f95e9a493cebbe296b7e7ad8ceb97abed8896e594afeb97a6eca094e981baec9aa9eb8f95e9a493cebbe296b742
UHC 筌뗫툖唯뗦젔遺용돕餓λ▷筌뗫툖唯뗦젔遺용돕餓λ▷B 11101111101001111000101111101011101110001000110111101010111001101000101111100110101000001001001011101011101101101011111111101011101101011011110111100100101110111010010111101011101000101011100111101111101001111000101111101011101110001000110111101010111001101000101111100110101000001001001011101011101101101011111111101011101101011011110111100100101110111010010111101011101000101011100101000010 efa78bebb88deae68be6a092ebb6bfebb5bde4bba5eba2b9efa78bebb88deae68be6a092ebb6bfebb5bde4bba5eba2b942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)