To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???邯???洵??怨??巍モ?竊??揄??^ 001111110011111100111111111001111011011000111111001111110011111110011111101010110011111100111111100010011000010100111111001111111001101111011001100000111000001000111111111000101000011000111111001111111001110110001001001111110011111101011110 3f3f3fe7b63f3f3f9fab3f3f89853f3f9bd983823fe2863f3f9d893f3f5e
EUC-JP ???邯???洵??怨??巍モ?竊??揄??^ 001111110011111100111111111011101011100000111111001111110011111111011110101011010011111100111111101100011110010100111111001111111101011011011011101001011110001000111111111000111110011000111111001111111101100111101001001111110011111101011110 3f3f3feeb83f3f3fdead3f3fb1e53f3fd6dba5e23fe3e63f3fd9e93f3f5e
UTF-8 列룸씈邯列룸씈洵ㅵ푻怨뺧폋巍モ뫖竊뽩ㅇ揄먭쿃^ 11101111101001101001110011101011101000111011100011101100100101001000100011101001100000101010111111101111101001101001110011101011101000111011100011101100100101001000100011100110101101001011010111100011100001011011010111101101100100011011101111100110100000001010100011101011101110101010011111101101100011111000101111100101101101111000110111100011100000111010001011101011101010111001011011100111101010111000101011101011101111011010100111100011100001011000011111100110100011111000010011101011101010001010110111101100101111111000001101011110 efa69ceba3b8ec9488e982afefa69ceba3b8ec9488e6b4b5e385b5ed91bbe680a8ebbaa7ed8f8be5b78de383a2ebab96e7ab8aebbda9e38587e68f84eba8adecbf835e
UHC 列룸씈邯列룸씈洵ㅵ푻怨뺧폋巍モ뫖竊뽩ㅇ揄먭쿃^ 111001101110101010110111111010111001110110100000110010101111101111100110111010101011011111101011100111011010000011100010111001111010010011100101101111101000011111101010101100111001010111101111101111001001011011101000111001001010101111100010100100011011100011101111101111001001011011100101101001001011011111101010111100011001000011101010101100101001100101011110 e6eab7eb9da0cafbe6eab7eb9da0e2e7a4e5be87eab395efbc96e8e4abe291b8efbc96e5a4b7eaf190eab2995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)