To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???夜??弱?ア節ョア節ヨ?節ワ?^ 0011111100111111001111111001011011101001001111110011111110001110111000110011111110000011010000011001000011011111100000111000011110000011010000011001000011011111100000111000100000111111100100001101111110000011100011110011111101011110 3f3f3f96e93f3f8ee33f834190df8387834190df83883f90df838f3f5e
EUC-JP ???夜??弱?ア節ョア節ヨ?節ワ?^ 0011111100111111001111111100110011101011001111110011111110111100111001010011111110100101101000101100000011100001101001011110011110100101101000101100000011100001101001011110100000111111110000001110000110100101111011110011111101011110 3f3f3fcceb3f3fbce53fa5a2c0e1a5e7a5a2c0e1a5e83fc0e1a5ef3f5e
UTF-8 若듸슈夜쇽쉼弱녺ア節ョア節ヨ쑠節ワ숲^ 11101111101001011011010011101011100100111011100011101100100010101000100011100101101001001001110011101100100001111011110111101100100010011011110011100101101111001011000111101011100001011011101011100011100000101010001011100111101011111000000011100011100000111010011111100011100000101010001011100111101011111000000011100011100000111010100011101100100100011010000011100111101011111000000011100011100000111010111111101100100010001011001001011110 efa5b4eb93b8ec8a88e5a49cec87bdec89bce5bcb1eb85bae382a2e7af80e383a7e382a2e7af80e383a8ec91a0e7af80e383afec88b25e
UHC 若듸슈夜쇽쉼弱녺ア節ョア節ヨ쑠節ワ숲^ 11100101101011101011010111101111101111011011010011100101101010001011110011101111101111011011000011100101101100001000011011100111101010111010001011101111101111011010101111100111101010111010001011101111101111011010101111101000100111001011111111101111101111011010101111101111101111011010001101011110 e5aeb5efbdb4e5a8bcefbdb0e5b086e7aba2efbdabe7aba2efbdabe89cbfefbdabefbda35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)