To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 僥??榮??將??鸚??絶?ぜ節??謠?? 1001100101000110001111110011111110011110110001000011111100111111100110111001001000111111001111111110101001011111001111110011111110010000111000100011111110000010101110101001000011011111001111110011111111100110100011110011111100111111 99463f3f9ec43f3f9b923f3fea5f3f3f90e23f82ba90df3f3fe68f3f3f
EUC-JP 僥??榮??將??鸚??絶?ぜ節??謠?? 1101000110100111001111110011111111011100110001100011111100111111110101011111001000111111001111111111001111000000001111110011111111000000111001000011111110100100101111001100000011100001001111110011111111101011111011110011111100111111 d1a73f3fdcc63f3fd5f23f3ff3c03f3fc0e43fa4bcc0e13f3febef3f3f
UTF-8 僥뺜툋榮쀯슐將됵슬鸚까겍絶쒒ぜ節몌숱謠쇽풛 111001011000001110100101111010111011101010011100111011011000100010001011111001101010011010101110111011001000000010101111111011001000101010010000111001011011000010000111111010111001000010110101111011001000101010101100111010011011100010011010111010101011100110001100111010101011001010001101111001111011010110110110111011001001001010010010111000111000000110011100111001111010111110000000111010111010101010001100111011001000100010110001111010001010110010100000111011001000011110111101111011011001001010011011 e583a5ebba9ced888be6a6aeec80afec8a90e5b087eb90b5ec8aace9b89aeab98ceab28de7b5b6ec9292e3819ce7af80ebaa8cec88b1e8aca0ec87bded929b
UHC 僥뺜툋榮쀯슐將됵슬鸚까겍絶쒒ぜ節몌숱謠쇽풛 111010001110100110010101111001001011100010000011111001111011010010010111111011111011110110110110111011011110001010001001111011111011110110111101111001011010010010110001111011101000000110100110111011111011111010011100111010011010101010111100111011111011110110111000111011111011110110100010111010011010101010111100111011111011111010011110 e8e995e4b883e7b497efbdb6ede289efbdbde5a4b1ee81a6efbe9ce9aabcefbdb8efbda2e9aabcefbe9e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)