To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲レ?乙??畏?????蹂l?繹??有 111000011001111110000011100011000011111110001001101100110011111100111111100010001101100000111111001111110011111100111111001111111110011011111000100000101000110000111111111000111000100000111111001111111001011101001100 e19f838c3f89b33f3f88d83f3f3f3f3fe6f8828c3fe3883f3f974c
EUC-JP 癲レ?乙??畏?????蹂l?繹??有 111000101010000110100101111011000011111110110010101101010011111100111111101100001101101000111111001111110011111100111111001111111110110011111010101000111110110000111111111001011110100000111111001111111100110110101101 e2a1a5ec3fb2b53f3fb0da3f3f3f3f3fecfaa3ec3fe5e83f3fcdad
UTF-8 癲レ떥乙대쿉畏브퀡履뗥퐲蹂l쑚繹먭퍗有 111001111001100110110010111000111000001110101100111010111001011010100101111001001011100110011001111010111000110010000000111011001011111110001001111001111001010110001111111010111011100010001100111011011000000010100001111011111010011110011111111010111001011110100101111011011001000010110010111010001011100110000010111011111011110110001100111011001001000110011010111001111011100110111001111010111010100010101101111011011000110110010111111001101001110010001001 e799b2e383aceb96a5e4b999eb8c80ecbf89e7958febb88ced80a1efa79feb97a5ed90b2e8b982efbd8cec919ae7b9b9eba8aded8d97e69c89
UHC 癲レ떥乙대쿉畏브퀡履뗥퐲蹂l쑚繹먭퍗有 1110111110100110101010111110110010001011101110001110101111100000101101001110101110110010100111101110100011100110101110101110101010110011100101011110110010101010100010111110010110111101100110111110101110110011101000111110110010011100101110011110011010111010100100001110101010111011100011101110101011110011 efa6abec8bb8ebe0b4ebb29ee8e6baeab395ecaa8be5bd9bebb3a3ec9cb9e6ba90eabb8eeaf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)