To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???夜??弱?【節ョア節ヨ?節ワ?^ 0011111100111111001111111001011011101001001111110011111110001110111000110011111110000001011110011001000011011111100000111000011110000011010000011001000011011111100000111000100000111111100100001101111110000011100011110011111101011110 3f3f3f96e93f3f8ee33f817990df8387834190df83883f90df838f3f5e
EUC-JP ???夜??弱?【節ョア節ヨ?節ワ?^ 0011111100111111001111111100110011101011001111110011111110111100111001010011111110100001110110101100000011100001101001011110011110100101101000101100000011100001101001011110100000111111110000001110000110100101111011110011111101011110 3f3f3fcceb3f3fbce53fa1dac0e1a5e7a5a2c0e1a5e83fc0e1a5ef3f5e
UTF-8 若듸슈夜쇽쉼弱녺【節ョア節ヨ뱰節ワ숲^ 11101111101001011011010011101011100100111011100011101100100010101000100011100101101001001001110011101100100001111011110111101100100010011011110011100101101111001011000111101011100001011011101011100011100000001001000011100111101011111000000011100011100000111010011111100011100000101010001011100111101011111000000011100011100000111010100011101011101100011011000011100111101011111000000011100011100000111010111111101100100010001011001001011110 efa5b4eb93b8ec8a88e5a49cec87bdec89bce5bcb1eb85bae38090e7af80e383a7e382a2e7af80e383a8ebb1b0e7af80e383afec88b25e
UHC 若듸슈夜쇽쉼弱녺【節ョア節ヨ뱰節ワ숲^ 11100101101011101011010111101111101111011011010011100101101010001011110011101111101111011011000011100101101100001000011011100111101000011011110011101111101111011010101111100111101010111010001011101111101111011010101111101000100100111001011011101111101111011010101111101111101111011010001101011110 e5aeb5efbdb4e5a8bcefbdb0e5b086e7a1bcefbdabe7aba2efbdabe89396efbdabefbda35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)