To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Cx}?????????Cx{^ 00111111001111110011111100111111001111110011111100111111001111110011111101000011011110000111110100111111001111110011111100111111001111110011111100111111001111110011111101000011011110000111101101011110 3f3f3f3f3f3f3f3f3f43787d3f3f3f3f3f3f3f3f3f43787b5e
SJIS-WIN 晤??汝??誼??Cx}晤??汝??誼??Cx{^ 10011101111010110011111100111111100100111111000000111111001111111000101101100010001111110011111101000011011110000111110110011101111010110011111100111111100100111111000000111111001111111000101101100010001111110011111101000011011110000111101101011110 9deb3f3f93f03f3f8b623f3f43787d9deb3f3f93f03f3f8b623f3f43787b5e
EUC-JP 晤??汝??誼??Cx}晤??汝??誼??Cx{^ 11011010111011010011111100111111110001101111001000111111001111111011010111000011001111110011111101000011011110000111110111011010111011010011111100111111110001101111001000111111001111111011010111000011001111110011111101000011011110000111101101011110 daed3f3fc6f23f3fb5c33f3f43787ddaed3f3fc6f23f3fb5c33f3f43787b5e
UTF-8 晤몄씖汝낆씤誼뽰룣Cx}晤몄씖汝낆씤誼뽰룣Cx{^ 11100110100110011010010011101011101010101000010011101100100101001001011011100110101100011001110111101011100000101000011011101100100101001010010011101000101010101011110011101011101111011011000011101011101000111010001101000011011110000111110111100110100110011010010011101011101010101000010011101100100101001001011011100110101100011001110111101011100000101000011011101100100101001010010011101000101010101011110011101011101111011011000011101011101000111010001101000011011110000111101101011110 e699a4ebaa84ec9496e6b19deb8286ec94a4e8aabcebbdb0eba3a343787de699a4ebaa84ec9496e6b19deb8286ec94a4e8aabcebbdb0eba3a343787b5e
UHC 晤몄씖汝낆씤誼뽰룣Cx}晤몄씖汝낆씤誼뽰룣Cx{^ 11100111111110111011100011101100100111011010101111100110101000111000010111101100100111011011100011101011111111101001011011101100100011111001110001000011011110000111110111100111111110111011100011101100100111011010101111100110101000111000010111101100100111011011100011101011111111101001011011101100100011111001110001000011011110000111101101011110 e7fbb8ec9dabe6a385ec9db8ebfe96ec8f9c43787de7fbb8ec9dabe6a385ec9db8ebfe96ec8f9c43787b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)