To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鞨ィ螳茨スイ邇匁キケ驍ィ螳茨スイ跪匁キケB 11101000111000001010100011100101101011101000100011101111101111011011001011100111100011101001011011100110101101111011100111101001100000101010100011100101101011101000100011101111101111011011001011100110111011001001011011100110101101111011100101000010 e8e0a8e5ae88efbdb2e78e96e6b7b9e982a8e5ae88efbdb2e6ec96e6b7b942
EUC-JP 鞨ィ螳茨スイ邇匁キケ驍ィ螳茨スイ跪匁キケB 1111000011100010100011101010100011101010101100001011000011110001100011101011110110001110101100101110110111101110110011001110100010001110101101111000111010111001111100011110001010001110101010001110101010110000101100001111000110001110101111011000111010110010111011001110111011001100111010001000111010110111100011101011100101000010 f0e28ea8eab0b0f18ebd8eb2edeecce88eb78eb9f1e28ea8eab0b0f18ebd8eb2eceecce88eb78eb942
UTF-8 鞨ィ螳茨スイ邇匁キケ驍ィ螳茨スイ跪匁キケB 11101001100111101010100011101111101111011010100011101000100111101011001111101000100011001010100011101111101111011011110111101111101111011011001011101001100000101000011111100101100011001000000111101111101111011011011111101111101111011011100111101001101010011000110111101111101111011010100011101000100111101011001111101000100011001010100011101111101111011011110111101111101111011011001011101000101101111010101011100101100011001000000111101111101111011011011111101111101111011011100101000010 e99ea8efbda8e89eb3e88ca8efbdbdefbdb2e98287e58c81efbdb7efbdb9e9a98defbda8e89eb3e88ca8efbdbdefbdb2e8b7aae58c81efbdb7efbdb942
UHC 鞨?螳茨??邇???驍?螳茨??????B 11001010111010100011111111010011110110011110110110111100001111110011111111101100110001000011111100111111001111111111110110100100001111111101001111011001111011011011110000111111001111110011111100111111001111110011111101000010 caea3fd3d9edbc3f3fecc43f3f3ffda43fd3d9edbc3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)