To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?閑荊?枇?緬ぇ垈^ 00111111100010101101010110001100011101000011111110010100111110000011111110010110110010011000001010100101100110101011000001011110 3f8ad58c743f94f83f96c982a59ab05e
EUC-JP ?閑荊?枇?緬ぇ垈^ 00111111101101001101011110110111110101010011111111001000111110100011111111001100110010111010010010100111110101001011001001011110 3fb4d7b7d53fc8fa3fcccba4a7d4b25e
UTF-8 뤋閑荊㎿枇샘緬ぇ垈^ 11101011101001001000101111101001100101101001000111101000100011011000101011100011100011101011111111100110100111101000011111101100100000111001100011100111101101111010110011100011100000011000011111100101100111101000100001011110 eba48be99691e88d8ae38ebfe69e87ec8398e7b7ace38187e59e885e
UHC 뤋閑荊㎿枇샘緬ぇ垈^ 10001111101110111111100111011000111110111010101010100111110100111101110111101101101110111111100111011000111110111010101010100111110100111101110001011110 8fbbf9d8fbaaa7d3ddedbbf9d8fbaaa7d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)