To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???韋??夜??一??擬??夷??瑜??^ 00111111001111110011111111101000111010000011111100111111100101101110100100111111001111111000100011101010001111110011111110001011010110110011111100111111100010001100111000111111001111111110000011101111001111110011111101011110 3f3f3fe8e83f3f96e93f3f88ea3f3f8b5b3f3f88ce3f3fe0ef3f3f5e
EUC-JP ???韋??夜??一??擬??夷??瑜??^ 00111111001111110011111111110000111010100011111100111111110011001110101100111111001111111011000011101100001111110011111110110101101111000011111100111111101100001101000000111111001111111110000011110001001111110011111101011110 3f3f3ff0ea3f3fcceb3f3fb0ec3f3fb5bc3f3fb0d03f3fe0f13f3f5e
UTF-8 僚녹뼔韋귥쮫夜쏅쪇一띌뒽擬쒕걗夷덂쮿瑜곹떢^ 11101111101001101011101111101011100001011011100111101011101111001001010011101001100111111000101111101010101101111010010111101100101011101010101111100101101001001001110011101100100011111000010111101100101010101000011111100100101110001000000011101011100111011000110011101011100100101011110111100110100100111010110011101100100100101001010111101010101100011001011111100101101001001011011111101011100011011000001011101100101011101011111111100111100100011001110011101010101100111011100111101011100101101010001001011110 efa6bbeb85b9ebbc94e99f8beab7a5ecaeabe5a49cec8f85ecaa87e4b880eb9d8ceb92bde693acec9295eab197e5a4b7eb8d82ecaebfe7919ceab3b9eb96a25e
UHC 僚녹뼔韋귥쮫夜쏅쪇一띌뒽擬쒕걗夷덂쮿瑜곹떢^ 11101000111010001011001111101100100101101001110011101010110111111000001011101100101010001000100011100101101010001001101111101011101001011000000111101100111010011011011011101001100010101011001111101011111101001001110011101011100000011000001011101100101010001000100011100101101010001001101111101011101001011000000111101101100010111011011001011110 e8e8b3ec969ceadf82eca888e5a89beba581ece9b6e98ab3ebf49ceb8182eca888e5a89beba581ed8bb65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)