To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??肉ヨ?衰??冗??宥??循??沃 100110100110101000111111001111111001001111110111100000111000100000111111100100001000101000111111001111111000111111100111001111110011111110010111010001110011111100111111100011110111101000111111001111111001011110000000 9a6a3f3f93f783883f908a3f3f8fe73f3f97473f3f8f7a3f3f9780
EUC-JP 嗚??肉ヨ?衰??冗??宥??循??沃 110100111100101100111111001111111100011011111001101001011110100000111111101111111110101000111111001111111011111011101001001111110011111111001101101010000011111100111111101111011101101100111111001111111100110111100000 d3cb3f3fc6f9a5e83fbfea3f3fbee93f3fcda83f3fbddb3f3fcde0
UTF-8 嗚삳챶肉ヨ린衰⑸겱冗밴낮宥꿩듉循딆뫅沃 111001011001011110011010111011001000001010110011111011001011000110110110111010001000001010001001111000111000001110101000111010111010011010110000111010001010000110110000111000101001000110111000111010101011001010110001111001011000011010010111111010111011000010110100111010111000001010101110111001011010111010100101111010101011111110101001111010111001001110001001111001011011111010101010111010111001010010000110111010111010101110000101111001101011001010000011 e5979aec82b3ecb1b6e88289e383a8eba6b0e8a1b0e291b8eab2b1e58697ebb0b4eb82aee5aea5eabfa9eb9389e5beaaeb9486ebab85e6b283
UHC 嗚삳챶肉ヨ린衰⑸겱冗밴낮宥꿩듉循딆뫅沃 1110011111110000101110111110101110101010100000111110101110111111101010111110100010111000101100001110000111110001101010011110101110000001101111011110100110110111101110011110101010110011101101111110101011101001101100101110011010001010101111001110001011100000100010101110110010010001101010001110100010101010 e7f0bbebaa83ebbfabe8b8b0e1f1a9eb81bde9b7b9eab3b7eae9b2e68abce2e08aec91a8e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)