To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 螂ェ蛛エ蟆願ェー鞜懈純遶ェ謐我辯隱ー鞜懈純B 11100101101001011010101011100101100000011011010011100101101100001000101011101000101010101011000011101000110111111001110011100110100011111000001111100111101010111010101011100110100011011000100111100100111001111000011111101000101010101011000011101000110111111001110011100110100011111000001101000010 e5a5aae581b4e5b08ae8aab0e8df9ce68f83e7abaae68d89e4e787e8aab0e8df9ce68f8342
EUC-JP 螂ェ蛛エ蟆願ェー鞜懈純遶ェ謐我辯隱ー鞜懈純B 11101010101001111000111010101010111010011110000110001110101101001110101010110010101101001110101010001110101010101000111010110000111100001110000111011000111010001011110111100011111011101010110110001110101010101110101111101101101100101110011011101101111001111111000010101100100011101011000011110000111000011101100011101000101111011110001101000010 eaa78eaae9e18eb4eab2b4ea8eaa8eb0f0e1d8e8bde3eead8eaaebedb2e6ede7f0ac8eb0f0e1d8e8bde342
UTF-8 螂ェ蛛エ蟆願ェー鞜懈純遶ェ謐我辯隱ー鞜懈純B 11101000100111101000001011101111101111011010101011101000100110111001101111101111101111011011010011101000100111111000011011101001101000011001100011101111101111011010101011101111101111011011000011101001100111101001110011100110100001111000100011100111101101001001010011101001100000011011011011101111101111011010101011101000101011001001000011100110100010001001000111101000101111101010111111101001100110101011000111101111101111011011000011101001100111101001110011100110100001111000100011100111101101001001010001000010 e89e82efbdaae89b9befbdb4e89f86e9a198efbdaaefbdb0e99e9ce68788e7b494e981b6efbdaae8ac90e68891e8beafe99ab1efbdb0e99e9ce68788e7b49442
UHC 螂?蛛??願???懈純??謐我辯隱??懈純B 110101011100110000111111111100011100100000111111001111111110101011000011001111110011111100111111111110101010101111100010111011010011111100111111110110101100110111100100101100101101110010101010111010111101111100111111001111111111101010101011111000101110110101000010 d5cc3ff1c83f3feac33f3f3ffaabe2ed3f3fdacde4b2dcaaebdf3f3ffaabe2ed42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)