To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 汚??野??巍??[汚??野??巍??[^ 100010011001100000111111001111111001011011101100001111110011111110011011110110010011111100111111010110111000100110011000001111110011111110010110111011000011111100111111100110111101100100111111001111110101101101011110 89983f3f96ec3f3f9bd93f3f5b89983f3f96ec3f3f9bd93f3f5b5e
EUC-JP 汚??野??巍??[汚??野??巍??[^ 101100011111100000111111001111111100110011101110001111110011111111010110110110110011111100111111010110111011000111111000001111110011111111001100111011100011111100111111110101101101101100111111001111110101101101011110 b1f83f3fccee3f3fd6db3f3f5bb1f83f3fccee3f3fd6db3f3f5b5e
UTF-8 汚좈냽野붻윝巍띶캈[汚좈냽野붻윝巍띶캈[^ 111001101011000110011010111011001010001010001000111010111000001110111101111010011000011110001110111010111011011010111011111011001001110010011101111001011011011110001101111010111001110110110110111011001011101010001000010110111110011010110001100110101110110010100010100010001110101110000011101111011110100110000111100011101110101110110110101110111110110010011100100111011110010110110111100011011110101110011101101101101110110010111010100010000101101101011110 e6b19aeca288eb83bde9878eebb6bbec9c9de5b78deb9db6ecba885be6b19aeca288eb83bde9878eebb6bbec9c9de5b78deb9db6ecba885b5e
UHC 汚좈냽野붻윝巍띶캈[汚좈냽野붻윝巍띶캈[^ 111001111111110110100000111010011000011010001101111001011010111110010100111010001001111110100000111010001110010010001101111001011010111110010100010110111110011111111101101000001110100110000110100011011110010110101111100101001110100010011111101000001110100011100100100011011110010110101111100101000101101101011110 e7fda0e9868de5af94e89fa0e8e48de5af945be7fda0e9868de5af94e89fa0e8e48de5af945b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)