To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 迢ク螻樔サ朴迢ク譌ヲ雉蓋迢ク譌ヲ邯夂矯 1110011110001011101110001110010110110001100111101110010010111011100101100111000011100111100010111011100011100110100101111010011011101000101100111000101001010111111001111000101110111000111001101001011110100110111001111011011010011010111001111000101110111000 e78bb8e5b19ee4bb9670e78bb8e697a6e8b38a57e78bb8e697a6e7b69ae78bb8
EUC-JP 迢ク螻樔サ朴迢ク譌ヲ雉蓋迢ク譌ヲ邯夂矯 1110110111101011100011101011100011101010101100111101110011100110100011101011101111001011110100011110110111101011100011101011100011101011111101111000111010100110111100001011010110110011101110001110110111101011100011101011100011101011111101111000111010100110111011101011100011010100111010011011011010111010 edeb8eb8eab3dce68ebbcbd1edeb8eb8ebf78ea6f0b5b3b8edeb8eb8ebf78ea6eeb8d4e9b6ba
UTF-8 迢ク螻樔サ朴迢ク譌ヲ雉蓋迢ク譌ヲ邯夂矯 111010001011111110100010111011111011110110111000111010001001111010111011111001101010100010010100111011111011110110111011111001101001110010110100111010001011111110100010111011111011110110111000111010001010110110001100111011111011110110100110111010011001101110001001111010001001001110001011111010001011111110100010111011111011110110111000111010001010110110001100111011111011110110100110111010011000001010101111111001011010010010000010111001111001111110101111 e8bfa2efbdb8e89ebbe6a894efbdbbe69cb4e8bfa2efbdb8e8ad8cefbda6e99b89e8938be8bfa2efbdb8e8ad8cefbda6e982afe5a482e79faf
UHC ?????朴????雉蓋????邯?矯 001111110011111100111111001111110011111111011010110100110011111100111111001111110011111111110110110010111100101111001111001111110011111100111111001111111100101011111011001111111100111011101100 3f3f3f3f3fdad33f3f3f3ff6cbcbcf3f3f3f3fcafb3fceec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)