To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汚??鈺??彦??節??譯?????齬?? 100010011001100000111111001111111111101111000100001111110011111110010101010001100011111100111111100100001101111100111111001111111110011010100001001111110011111100111111001111110011111111101010100101110011111100111111 89983f3ffbc43f3f95463f3f90df3f3fe6a13f3f3f3f3fea973f3f
EUC-JP 汚??鈺??彦??節??譯?????齬?? 10110001111110000011111100111111100011111110001111010101001111110011111111001001101001110011111100111111110000001110000100111111001111111110110010100011001111110011111100111111001111110011111111110011111101110011111100111111 b1f83f3f8fe3d53f3fc9a73f3fc0e13f3feca33f3f3f3f3ff3f73f3f
UTF-8 汚억슬鈺싨퍜彦룩읈節썼춾譯됵슴咽뀐쉭齬뱄슴 111001101011000110011010111011001001011010110101111011001000101010101100111010011000100010111010111011001000101110101000111011011000110110011100111001011011110110100110111010111010001110101001111011001001110110001000111001111010111110000000111011001000110110111100111011001011011010111110111010001010110110101111111010111001000010110101111011001000101010110100111011111010011010011110111010111000000010010000111011001000100110101101111010011011110110101100111010111011000110000100111011001000101010110100 e6b19aec96b5ec8aace988baec8ba8ed8d9ce5bda6eba3a9ec9d88e7af80ec8dbcecb6bee8adafeb90b5ec8ab4efa69eeb8090ec89ade9bdacebb184ec8ab4
UHC 汚억슬鈺싨퍜彦룩읈節썼춾譯됵슴咽뀐쉭齬뱄슴 111001111111110110111110111011111011110110111101111010001010110110011010111001101011101110010011111001011110100110110111111010001001111110111110111011111011110110111101111010001010110110011010111001101011101110001001111011111011110110111111111001101110110010110010111011111011110110101101111001011110000110111001111011111011110110111111 e7fdbeefbdbde8ad9ae6bb93e5e9b7e89fbeefbdbde8ad9ae6bb89efbdbfe6ecb2efbdade5e1b9efbdbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)