To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 闇γ????音??n}闇γ????音??n{^ 1000100011000101100000111100000100111111001111110011111100111111100010011011100100111111001111110110111001111101100010001100010110000011110000010011111100111111001111110011111110001001101110010011111100111111011011100111101101011110 88c583c13f3f3f3f89b93f3f6e7d88c583c13f3f3f3f89b93f3f6e7b5e
EUC-JP 闇γ?靷??音??n}闇γ?靷??音??n{^ 101100001100011110100110110000110011111110001111111001111011110100111111001111111011001010111011001111110011111101101110011111011011000011000111101001101100001100111111100011111110011110111101001111110011111110110010101110110011111100111111011011100111101101011110 b0c7a6c33f8fe7bd3f3fb2bb3f3f6e7db0c7a6c33f8fe7bd3f3fb2bb3f3f6e7b5e
UTF-8 闇γ룢靷뗥♤音붵뀅n}闇γ룢靷뗥♤音붵뀅n{^ 111010011001011110000111110011101011001111101011101000111010001011101001100111011011011111101011100101111010010111100010100110011010010011101001100111111011001111101011101101101011010111101011100000001000010101101110011111011110100110010111100001111100111010110011111010111010001110100010111010011001110110110111111010111001011110100101111000101001100110100100111010011001111110110011111010111011011010110101111010111000000010000101011011100111101101011110 e99787ceb3eba3a2e99db7eb97a5e299a4e99fb3ebb6b5eb80856e7de99787ceb3eba3a2e99db7eb97a5e299a4e99fb3ebb6b5eb80856e7b5e
UHC 闇γ룢靷뗥♤音붵뀅n}闇γ룢靷뗥♤音붵뀅n{^ 1110010011100001101001011110001110001111100110111110110011100110100010111110010110100010101110111110101111100101100101001110001110000101100000010110111001111101111001001110000110100101111000111000111110011011111011001110011010001011111001011010001010111011111010111110010110010100111000111000010110000001011011100111101101011110 e4e1a5e38f9bece68be5a2bbebe594e385816e7de4e1a5e38f9bece68be5a2bbebe594e385816e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)