To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 迢ク譚大ア樒矯譚大ア杣迢ク譚大ア樒矯譚大ア杣^ 111001111000101110111000111001101001110110010001111001011011000110011110111001111000101110111000111001101001110110010001111001011011000110011110010110111110011110001011101110001110011010011101100100011110010110110001100111101110011110001011101110001110011010011101100100011110010110110001100111100101101101011110 e78bb8e69d91e5b19ee78bb8e69d91e5b19e5be78bb8e69d91e5b19ee78bb8e69d91e5b19e5b5e
EUC-JP 迢ク譚大ア樒矯譚大ア杣迢ク譚大ア樒矯譚大ア杣^ 111011011110101110001110101110001110101111111101110000101110011110001110101100011101110011101001101101101011101011101011111111011100001011100111100011101011000111011011101111001110110111101011100011101011100011101011111111011100001011100111100011101011000111011100111010011011011010111010111010111111110111000010111001111000111010110001110110111011110001011110 edeb8eb8ebfdc2e78eb1dce9b6baebfdc2e78eb1dbbcedeb8eb8ebfdc2e78eb1dce9b6baebfdc2e78eb1dbbc5e
UTF-8 迢ク譚大ア樒矯譚大ア杣迢ク譚大ア樒矯譚大ア杣^ 11101000101111111010001011101111101111011011100011101000101011011001101011100101101001001010011111101111101111011011000111100110101010001001001011100111100111111010111111101000101011011001101011100101101001001010011111101111101111011011000111100110100111011010001111101000101111111010001011101111101111011011100011101000101011011001101011100101101001001010011111101111101111011011000111100110101010001001001011100111100111111010111111101000101011011001101011100101101001001010011111101111101111011011000111100110100111011010001101011110 e8bfa2efbdb8e8ad9ae5a4a7efbdb1e6a892e79fafe8ad9ae5a4a7efbdb1e69da3e8bfa2efbdb8e8ad9ae5a4a7efbdb1e6a892e79fafe8ad9ae5a4a7efbdb1e69da35e
UHC ??譚大??矯譚大????譚大??矯譚大??^ 001111110011111111010011110010011101001111011110001111110011111111001110111011001101001111001001110100111101111000111111001111110011111100111111110100111100100111010011110111100011111100111111110011101110110011010011110010011101001111011110001111110011111101011110 3f3fd3c9d3de3f3fceecd3c9d3de3f3f3f3fd3c9d3de3f3fceecd3c9d3de3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)