To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 迢ク譌ヲ陲也矯譌乗據迢ク譌ヲ陲也矯譌乗據B 11100111100010111011100011100110100101111010011011101000101000101001011011100111100010111011100011100110100101111000111111100110100111011001111111100111100010111011100011100110100101111010011011101000101000101001011011100111100010111011100011100110100101111000111111100110100111011001111101000010 e78bb8e697a6e8a296e78bb8e6978fe69d9fe78bb8e697a6e8a296e78bb8e6978fe69d9f42
EUC-JP 迢ク譌ヲ陲也矯譌乗據迢ク譌ヲ陲也矯譌乗據B 1110110111101011100011101011100011101011111101111000111010100110111100001010010011001100111010011011011010111010111010111111011110111110111010001101101010100001111011011110101110001110101110001110101111110111100011101010011011110000101001001100110011101001101101101011101011101011111101111011111011101000110110101010000101000010 edeb8eb8ebf78ea6f0a4cce9b6baebf7bee8daa1edeb8eb8ebf78ea6f0a4cce9b6baebf7bee8daa142
UTF-8 迢ク譌ヲ陲也矯譌乗據迢ク譌ヲ陲也矯譌乗據B 11101000101111111010001011101111101111011011100011101000101011011000110011101111101111011010011011101001100110011011001011100100101110011001111111100111100111111010111111101000101011011000110011100100101110011001011111100110100100111001101011101000101111111010001011101111101111011011100011101000101011011000110011101111101111011010011011101001100110011011001011100100101110011001111111100111100111111010111111101000101011011000110011100100101110011001011111100110100100111001101001000010 e8bfa2efbdb8e8ad8cefbda6e999b2e4b99fe79fafe8ad8ce4b997e6939ae8bfa2efbdb8e8ad8cefbda6e999b2e4b99fe79fafe8ad8ce4b997e6939a42
UHC ?????也矯??據?????也矯??據B 001111110011111100111111001111110011111111100101101001011100111011101100001111110011111111001011111000000011111100111111001111110011111100111111111001011010010111001110111011000011111100111111110010111110000001000010 3f3f3f3f3fe5a5ceec3f3fcbe03f3f3f3f3fe5a5ceec3f3fcbe042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)