To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 跌ク譌ヲ陲也矯襄乗據跌ク譌ヲ陲也矯襄乗據B 11100110111010011011100011100110100101111010011011101000101000101001011011100111100010111011100011100101111101011000111111100110100111011001111111100110111010011011100011100110100101111010011011101000101000101001011011100111100010111011100011100101111101011000111111100110100111011001111101000010 e6e9b8e697a6e8a296e78bb8e5f58fe69d9fe6e9b8e697a6e8a296e78bb8e5f58fe69d9f42
EUC-JP 跌ク譌ヲ陲也矯襄乗據跌ク譌ヲ陲也矯襄乗據B 1110110011101011100011101011100011101011111101111000111010100110111100001010010011001100111010011011011010111010111010101111011110111110111010001101101010100001111011001110101110001110101110001110101111110111100011101010011011110000101001001100110011101001101101101011101011101010111101111011111011101000110110101010000101000010 eceb8eb8ebf78ea6f0a4cce9b6baeaf7bee8daa1eceb8eb8ebf78ea6f0a4cce9b6baeaf7bee8daa142
UTF-8 跌ク譌ヲ陲也矯襄乗據跌ク譌ヲ陲也矯襄乗據B 11101000101101111000110011101111101111011011100011101000101011011000110011101111101111011010011011101001100110011011001011100100101110011001111111100111100111111010111111101000101001011000010011100100101110011001011111100110100100111001101011101000101101111000110011101111101111011011100011101000101011011000110011101111101111011010011011101001100110011011001011100100101110011001111111100111100111111010111111101000101001011000010011100100101110011001011111100110100100111001101001000010 e8b78cefbdb8e8ad8cefbda6e999b2e4b99fe79fafe8a584e4b997e6939ae8b78cefbdb8e8ad8cefbda6e999b2e4b99fe79fafe8a584e4b997e6939a42
UHC 跌????也矯襄?據跌????也矯襄?據B 11110010111101100011111100111111001111110011111111100101101001011100111011101100111001011101000100111111110010111110000011110010111101100011111100111111001111110011111111100101101001011100111011101100111001011101000100111111110010111110000001000010 f2f63f3f3f3fe5a5ceece5d13fcbe0f2f63f3f3f3fe5a5ceece5d13fcbe042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)