To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??蹂μ?癲??唯??猿??筌??? 111000011001111100111111001111111000101101011000001111110011111111100110111110001000001111001010001111111110000110011111001111110011111110010111010000100011111100111111100010011000111000111111001111111110001010100011001111110011111100111111 e19f3f3f8b583f3fe6f883ca3fe19f3f3f97423f3f898e3f3fe2a33f3f3f
EUC-JP 癲??宜??蹂μ?癲??唯??猿??筌??沅 1110001010100001001111110011111110110101101110010011111100111111111011001111101010100110110011000011111111100010101000010011111100111111110011011010001100111111001111111011000111101110001111110011111111100100101001010011111100111111100011111100011011101001 e2a13f3fb5b93f3fecfaa6cc3fe2a13f3fcda33f3fb1ee3f3fe4a53f3f8fc6e9
UTF-8 癲됯낱宜쇽쭓蹂μ젫癲딉퐣唯뽫쑴猿낅쭜筌뗭뮁沅 1110011110011001101100101110101110010000101011111110101110000010101100011110010110101110100111001110110010000111101111011110110010101101100100111110100010111001100000101100111010111100111011001010000010101011111001111001100110110010111010111001010010001001111011011001000010100011111001011001010010101111111010111011110110101011111011001001000110110100111001111000110010111111111010111000001010000101111011001010110110011100111001111010110110001100111010111001011110101101111010111010111010000001111001101011001010000101 e799b2eb90afeb82b1e5ae9cec87bdecad93e8b982cebceca0abe799b2eb9489ed90a3e594afebbdabec91b4e78cbfeb8285ecad9ce7ad8ceb97adebae81e6b285
UHC 癲됯낱宜쇽쭓蹂μ젫癲딉퐣唯뽫쑴猿낅쭜筌뗭뮁沅 1110111110100110100010011110101010110011101110011110101111110001101111001110111110100111100010111110101110110011101001011110110010100000101000111110111110100110100010101110111110111101100011001110101011100110100101101110011110111110101010011110101010111011100001011110101110100111100100101110111110100111100010111110110010010010100100001110101010110110 efa689eab3b9ebf1bcefa78bebb3a5eca0a3efa68aefbd8ceae696e7bea9eabb85eba792efa78bec9290eab6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)