To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????M}?????????M{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100110101111101001111110011111100111111001111110011111100111111001111110011111100111111010011010111101101011110 3f3f3f3f3f3f3f3f3f4d7d3f3f3f3f3f3f3f3f3f4d7b5e
SJIS-WIN 蒡湲伎ス堺釗ム危┳M}蒡湲伎ス堺釗ム危┳M{^ 11100100111011101001111111010001100010101110101010111101100011011110010011111011101110111101000110001010111010111000010010110001010011010111110111100100111011101001111111010001100010101110101010111101100011011110010011111011101110111101000110001010111010111000010010110001010011010111101101011110 e4ee9fd18aeabd8de4fbbbd18aeb84b14d7de4ee9fd18aeabd8de4fbbbd18aeb84b14d7b5e
EUC-JP 蒡湲伎ス堺釗ム危┳M}蒡湲伎ス堺釗ム危┳M{^ 11101000111100001101111011010011101101001110110010001110101111011011101011100110100011111110001110100110100011101101000110110100111011011010100010110011010011010111110111101000111100001101111011010011101101001110110010001110101111011011101011100110100011111110001110100110100011101101000110110100111011011010100010110011010011010111101101011110 e8f0ded3b4ec8ebdbae68fe3a68ed1b4eda8b34d7de8f0ded3b4ec8ebdbae68fe3a68ed1b4eda8b34d7b5e
UTF-8 蒡湲伎ス堺釗ム危┳M}蒡湲伎ス堺釗ム危┳M{^ 1110100010010010101000011110011010111001101100101110010010111100100011101110111110111101101111011110010110100000101110101110100110000111100101111110111110111110100100011110010110001101101100011110001010010100101100110100110101111101111010001001001010100001111001101011100110110010111001001011110010001110111011111011110110111101111001011010000010111010111010011000011110010111111011111011111010010001111001011000110110110001111000101001010010110011010011010111101101011110 e892a1e6b9b2e4bc8eefbdbde5a0bae98797efbe91e58db1e294b34d7de892a1e6b9b2e4bc8eefbdbde5a0bae98797efbe91e58db1e294b34d7b5e
UHC 蒡湲伎?堺釗?危┳M}蒡湲伎?堺釗?危┳M{^ 11011011101111001110101010111000110100001110101100111111110011001111011111100001111100100011111111101010110010111010011010110011010011010111110111011011101111001110101010111000110100001110101100111111110011001111011111100001111100100011111111101010110010111010011010110011010011010111101101011110 dbbceab8d0eb3fccf7e1f23feacba6b34d7ddbbceab8d0eb3fccf7e1f23feacba6b34d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)