To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???唯??碎??瑤?????淞る?語≪?B 0011111100111111001111111001011101000010001111110011111111100001111010100011111100111111111010101010001000111111001111110011111100111111001111111001111111000010100000101110100100111111100011001110101010000001111000010011111101000010 3f3f3f97423f3fe1ea3f3feaa23f3f3f3f3f9fc282e93f8cea81e13f42
EUC-JP ???唯??碎??瑤?????淞る?語≪?B 0011111100111111001111111100110110100011001111110011111111100010111011000011111100111111111101001010010000111111001111110011111100111111001111111101111011000100101001001110101100111111101110001110110010100010111000110011111101000010 3f3f3fcda33f3fe2ec3f3ff4a43f3f3f3f3fdec4a4eb3fb8eca2e33f42
UTF-8 嶺뚢돦唯쎽튃碎쇈럶瑤녠퉫藺쇘춯淞る닔語≪퀗B 11101111101001101010101111101011100110101010001011101011100011111010011011100101100101001010111111101100100011101011110111101101100010101000001111100111101000101000111011101100100001111000100011101011100111111011011011100111100100011010010011101011100001011010000011101101100010011010101111101111101001111011000011101100100001111001100011101100101101101010111111100110101101111001111011100011100000101000101111101011100010111001010011101000101010101001111011100010100010011010101011101101100000001001011101000010 efa6abeb9aa2eb8fa6e594afec8ebded8a83e7a28eec8788eb9fb6e791a4eb85a0ed89abefa7b0ec8798ecb6afe6b79ee3828beb8b94e8aa9ee289aaed809742
UHC 嶺뚢돦唯쎽튃碎쇈럶瑤녠퉫藺쇘춯淞る닔語≪퀗B 11100111101011011000110011100010100010011010101011101010111001101001101111100100101110011001100111100001111011111011110011100011100011101001010111101000111111011011001111101010101110011000001111101100111000011011110011100111101011011000110011100001111001111010101011101011100010001001100011100101110111101010000111101100101100111000110001000010 e7ad8ce289aaeae69be4b999e1efbce38e95e8fdb3eab983ece1bce7ad8ce1e7aaeb8898e5dea1ecb38c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)