To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????ぜ松??榮??議??柔j?沃 0011111100111111001111110011111100111111100000101011101010001111101111000011111100111111100111101100010000111111001111111000101101100011001111110011111110001111010111111000001010001010001111111001011110000000 3f3f3f3f3f82ba8fbc3f3f9ec43f3f8b633f3f8f5f828a3f9780
EUC-JP ???靷?ぜ松??榮??議??柔j?沃 00111111001111110011111110001111111001111011110100111111101001001011110010111110101111100011111100111111110111001100011000111111001111111011010111000100001111110011111110111101110000001010001111101010001111111100110111100000 3f3f3f8fe7bd3fa4bcbebe3f3fdcc63f3fb5c43f3fbdc0a3ea3fcde0
UTF-8 嶺뚮뿫靷뽬ぜ松쎌춷榮붾낌議귞춯柔j틓沃 111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111010111011110110101100111000111000000110011100111001101001110110111110111011001000111010001100111011001011011010110111111001101010011010101110111010111011011010111110111010111000001010001100111010001010110110110000111010101011011110011110111011001011011010101111111001101001111110010100111011111011110110001010111011011000101110010011111001101011001010000011 efa6abeb9aaeebbfabe99db7ebbdace3819ce69dbeec8e8cecb6b7e6a6aeebb6beeb828ce8adb0eab79eecb6afe69f94efbd8aed8b93e6b283
UHC 嶺뚮뿫靷뽬ぜ松쎌춷榮붾낌議귞춯柔j틓沃 1110011110101101100011001110101110010111101010111110110011100110100101101110100010101010101111001110000111100110101111011110110010101101100100111110011110110100100101001110101110110011101001101110110010100001100000101110011110101101100011001110101011110101101000111110101010111010100000101110100010101010 e7ad8ceb97abece696e8aabce1e6bdecad93e7b494ebb3a6eca182e7ad8ceaf5a3eaba82e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)