To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??扱猷??袁??厄???亦??? 11100001100111110011111100111111100101110101000100111111001111111000100010110101100101110101000100111111001111111110010111001101001111110011111110010110111011110011111100111111001111111001011010010010001111110011111100111111 e19f3f3f97513f3f88b597513f3fe5cd3f3f96ef3f3f3f96923f3f3f
EUC-JP 癲??猷??扱猷??袁??厄???亦??沅 111000101010000100111111001111111100110110110010001111110011111110110000101101111100110110110010001111110011111111101010110011110011111100111111110011001111000100111111001111110011111111001011111100100011111100111111100011111100011011101001 e2a13f3fcdb23f3fb0b7cdb23f3feacf3f3fccf13f3f3fcbf23f3f8fc6e9
UTF-8 癲놁엱猷볩쭒扱猷싮넫袁⑹몥厄닌됱챻亦뱀늸沅 111001111001100110110010111010111000011010000001111011001001011110110001111001111000110010110111111010111011001110101001111011001010110110010010111001101000100110110001111001111000110010110111111011001000101110101110111010111000010010101011111010001010001010000001111000101001000110111001111010111010101010100101111001011000111010000100111010111000101110001100111010111001000010110001111011001011000110111011111001001011101010100110111010111011000110000000111010111000101010111000111001101011001010000101 e799b2eb8681ec97b1e78cb7ebb3a9ecad92e689b1e78cb7ec8baeeb84abe8a281e291b9ebaaa5e58e84eb8b8ceb90b1ecb1bbe4baa6ebb180eb8ab8e6b285
UHC 癲놁엱猷볩쭒扱猷싮넫袁⑹몥厄닌됱챻亦뱀늸沅 111011111010011010000110111011001001111010000110111010111010001110010011111011111010011110001010110100001110001011101011101000111001101011101001100001101010101111101010101111101010100111101100100100011001001111100100111110001011010011010001100010011110110010101010100010001110011010110010101110011110110010001000100000011110101010110110 efa686ec9e86eba393efa78ad0e2eba39ae986abeabea9ec9193e4f8b4d189ecaa88e6b2b9ec8881eab6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)