To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????h?????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f
SJIS-WIN 迢ク螻樣●迢ク譌丞穀h迢ク螻樣●迢ク譌丞穀 11100111100010111011100011100101101100011001111011101001100000011001110011100111100010111011100011100110100101111000111111100101100011011001001001101000111001111000101110111000111001011011000110011110111010011000000110011100111001111000101110111000111001101001011110001111111001011000110110010010 e78bb8e5b19ee9819ce78bb8e6978fe58d9268e78bb8e5b19ee9819ce78bb8e6978fe58d92
EUC-JP 迢ク螻樣●迢ク譌丞穀h迢ク螻樣●迢ク譌丞穀 1110110111101011100011101011100011101010101100111101110011101011101000011111110011101101111010111000111010111000111010111111011110111110111001111011100111110010011010001110110111101011100011101011100011101010101100111101110011101011101000011111110011101101111010111000111010111000111010111111011110111110111001111011100111110010 edeb8eb8eab3dceba1fcedeb8eb8ebf7bee7b9f268edeb8eb8eab3dceba1fcedeb8eb8ebf7bee7b9f2
UTF-8 迢ク螻樣●迢ク譌丞穀h迢ク螻樣●迢ク譌丞穀 11101000101111111010001011101111101111011011100011101000100111101011101111100110101010001010001111100010100101111000111111101000101111111010001011101111101111011011100011101000101011011000110011100100101110001001111011100111101010011000000001101000111010001011111110100010111011111011110110111000111010001001111010111011111001101010100010100011111000101001011110001111111010001011111110100010111011111011110110111000111010001010110110001100111001001011100010011110111001111010100110000000 e8bfa2efbdb8e89ebbe6a8a3e2978fe8bfa2efbdb8e8ad8ce4b89ee7a98068e8bfa2efbdb8e89ebbe6a8a3e2978fe8bfa2efbdb8e8ad8ce4b89ee7a980
UHC ???樣●???丞穀h???樣●???丞穀 0011111100111111001111111110010111000110101000011101110000111111001111110011111111100011101010101100110111011010011010000011111100111111001111111110010111000110101000011101110000111111001111110011111111100011101010101100110111011010 3f3f3fe5c6a1dc3f3f3fe3aacdda683f3f3fe5c6a1dc3f3f3fe3aacdda

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)