To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 魚??韋?????????k?????m?B 1000101110011011001111110011111111101000111010000011111100111111001111110011111100111111001111110011111100111111001111111000001010001011001111110011111100111111001111110011111110000010100011010011111101000010 8b9b3f3fe8e83f3f3f3f3f3f3f3f3f828b3f3f3f3f3f828d3f42
EUC-JP 魚??韋?????靷??洹k????洹m?B 1011010111111011001111110011111111110000111010100011111100111111001111110011111100111111100011111110011110111101001111110011111110001111110001111011101010100011111010110011111100111111001111110011111110001111110001111011101010100011111011010011111101000010 b5fb3f3ff0ea3f3f3f3f3f8fe7bd3f3f8fc7baa3eb3f3f3f3f8fc7baa3ed3f42
UTF-8 魚잕랜韋껋㏏捻곌랜靷쀦궆洹k걙吏밭솻洹m닓B 11101001101011011001101011101100100111101001010111101011100111101001110011101001100111111000101111101010101110111000101111100011100011111000111111101111101001101010010011101010101100111000110011101011100111101001110011101001100111011011011111101100100000001010011011101010101101101000011011100110101101001011100111101111101111011000101111101010101100011001100111101111101001111001111011101011101100001010110111101100100001101011101111100110101101001011100111101111101111011000110111101011100010111001001101000010 e9ad9aec9e95eb9e9ce99f8beabb8be38f8fefa6a4eab38ceb9e9ce99db7ec80a6eab686e6b4b9efbd8beab199efa79eebb0adec86bbe6b4b9efbd8deb8b9342
UHC 魚잕랜韋껋㏏捻곌랜靷쀦궆洹k걙吏밭솻洹m닓B 11100101111000001001111111101010101101111010001111101010110111111000001111101100101001111011100111100110111101111011000011101010101101111010001111101100111001101001011111100110100000101001111111101010101101111010001111101011100000011000001111101100101001111011100111100111100110011011000011101010101101111010001111101101100010001001011101000010 e5e09feab7a3eadf83eca7b9e6f7b0eab7a3ece697e6829feab7a3eb8183eca7b9e799b0eab7a3ed889742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)