To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 迂?蜃?????檢}v迂?蜃?????檢}vB 1000100101001001001111111110010110000111001111110011111100111111001111110011111110011110111110110111110101110110100010010100100100111111111001011000011100111111001111110011111100111111001111111001111011111011011111010111011001000010 89493fe5873f3f3f3f3f9efb7d7689493fe5873f3f3f3f3f9efb7d7642
EUC-JP 迂?蜃?????檢}v迂?蜃?????檢}vB 1011000110101010001111111110100111100111001111110011111100111111001111110011111111011100111111010111110101110110101100011010101000111111111010011110011100111111001111110011111100111111001111111101110011111101011111010111011001000010 b1aa3fe9e73f3f3f3f3fdcfd7d76b1aa3fe9e73f3f3f3f3fdcfd7d7642
UTF-8 迂렧蜃렎렽닻렚븜檢}v迂렧蜃렎렽닻렚븜檢}vB 1110100010111111100000101110101110100000101001111110100010011100100000111110101110100000100011101110101110100000101111011110101110001011101110111110101110100000100110101110101110111000100111001110011010101010101000100111110101110110111010001011111110000010111010111010000010100111111010001001110010000011111010111010000010001110111010111010000010111101111010111000101110111011111010111010000010011010111010111011100010011100111001101010101010100010011111010111011001000010 e8bf82eba0a7e89c83eba08eeba0bdeb8bbbeba09aebb89ce6aaa27d76e8bf82eba0a7e89c83eba08eeba0bdeb8bbbeba09aebb89ce6aaa27d7642
UHC 迂렧蜃렎렽닻렚븜檢}v迂렧蜃렎렽닻렚븜檢}vB 1110100111100110100011101011011011100011111100011000111010100100100011101100010110110100111010011000111010101101101110101110111011001011111111100111110101110110111010011110011010001110101101101110001111110001100011101010010010001110110001011011010011101001100011101010110110111010111011101100101111111110011111010111011001000010 e9e68eb6e3f18ea48ec5b4e98eadbaeecbfe7d76e9e68eb6e3f18ea48ec5b4e98eadbaeecbfe7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)