To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汚??鈺??彦??節??岳??B 100010011001100000111111001111111111101111000100001111110011111110010101010001100011111100111111100100001101111100111111001111111000101001111000001111110011111101000010 89983f3ffbc43f3f95463f3f90df3f3f8a783f3f42
EUC-JP 汚??鈺??彦??節??岳??B 10110001111110000011111100111111100011111110001111010101001111110011111111001001101001110011111100111111110000001110000100111111001111111011001111011001001111110011111101000010 b1f83f3f8fe3d53f3fc9a73f3fc0e13f3fb3d93f3f42
UTF-8 汚억슬鈺싨퍜彦룬뼹節김웶岳뜻뱴B 11100110101100011001101011101100100101101011010111101100100010101010110011101001100010001011101011101100100010111010100011101101100011011001110011100101101111011010011011101011101000111010110011101011101111001011100111100111101011111000000011101010101110011000000011101100100110111011011011100101101100101011001111101011100111001011101111101011101100011011010001000010 e6b19aec96b5ec8aace988baec8ba8ed8d9ce5bda6eba3acebbcb9e7af80eab980ec9bb6e5b2b3eb9cbbebb1b442
UHC 汚억슬鈺싨퍜彦룬뼹節김웶岳뜻뱴B 11100111111111011011111011101111101111011011110111101000101011011001101011100110101110111001001111100101111010011011011111101001100101101011110011101111101111011011000111101000100111111000010011100100101111111011011011100110100100111001101001000010 e7fdbeefbdbde8ad9ae6bb93e5e9b7e996bcefbdb1e89f84e4bfb6e6939a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)