To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN ?k?循??筍ル?h?k?循??筍ル? 001111111000001010001011001111111000111101111010001111110011111111100010101000011000001110001011001111110110100000111111100000101000101100111111100011110111101000111111001111111110001010100001100000111000101100111111 3f828b3f8f7a3f3fe2a1838b3f683f828b3f8f7a3f3fe2a1838b3f
EUC-JP 渶k?循??筍ル?h渶k?循??筍ル? 10001111110001111110110110100011111010110011111110111101110110110011111100111111111001001010001110100101111010110011111101101000100011111100011111101101101000111110101100111111101111011101101100111111001111111110010010100011101001011110101100111111 8fc7eda3eb3fbddb3f3fe4a3a5eb3f688fc7eda3eb3fbddb3f3fe4a3a5eb3f
UTF-8 渶k겳循븝쫵筍ル섶h渶k겳循븝쫵筍ル섶 11100110101110001011011011101111101111011000101111101010101100101011001111100101101111101010101011101011101110001001110111101100101010111011010111100111101011011000110111100011100000111010101111101100100001001011011001101000111001101011100010110110111011111011110110001011111010101011001010110011111001011011111010101010111010111011100010011101111011001010101110110101111001111010110110001101111000111000001110101011111011001000010010110110 e6b8b6efbd8beab2b3e5beaaebb89decabb5e7ad8de383abec84b668e6b8b6efbd8beab2b3e5beaaebb89decabb5e7ad8de383abec84b6
UHC 渶k겳循븝쫵筍ル섶h渶k겳循븝쫵筍ル섶 11100111101101111010001111101011100000011011111111100010111000001011101011101111101001101000110011100010111011001010101111101011101111001011101101101000111001111011011110100011111010111000000110111111111000101110000010111010111011111010011010001100111000101110110010101011111010111011110010111011 e7b7a3eb81bfe2e0baefa68ce2ecabebbcbb68e7b7a3eb81bfe2e0baefa68ce2ecabebbcbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)