To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 源??蛟???枯n}源??蛟???枯n{^ 100011001011100100111111001111111110010110000000001111110011111100111111100011001100110101101110011111011000110010111001001111110011111111100101100000000011111100111111001111111000110011001101011011100111101101011110 8cb93f3fe5803f3f3f8ccd6e7d8cb93f3fe5803f3f3f8ccd6e7b5e
EUC-JP 源??蛟??雩枯n}源??蛟??雩枯n{^ 10111000101110110011111100111111111010011110000000111111001111111000111111100110111110101011100011001111011011100111110110111000101110110011111100111111111010011110000000111111001111111000111111100110111110101011100011001111011011100111101101011110 b8bb3f3fe9e03f3f8fe6fab8cf6e7db8bb3f3fe9e03f3f8fe6fab8cf6e7b5e
UTF-8 源재렫蛟렰렒雩枯n}源재렫蛟렰렒雩枯n{^ 1110011010111010100100001110110010011110101011001110101110100000101010111110100010011011100111111110101110100000101100001110101110100000100100101110100110011011101010011110011010011110101011110110111001111101111001101011101010010000111011001001111010101100111010111010000010101011111010001001101110011111111010111010000010110000111010111010000010010010111010011001101110101001111001101001111010101111011011100111101101011110 e6ba90ec9eaceba0abe89b9feba0b0eba092e99ba9e69eaf6e7de6ba90ec9eaceba0abe89b9feba0b0eba092e99ba9e69eaf6e7b5e
UHC 源재렫蛟렰렒雩枯n}源재렫蛟렰렒雩枯n{^ 11101010101110011100000011100111100011101011100111001110111100011000111010111101100011101010011111101001111011001100110110111101011011100111110111101010101110011100000011100111100011101011100111001110111100011000111010111101100011101010011111101001111011001100110110111101011011100111101101011110 eab9c0e78eb9cef18ebd8ea7e9eccdbd6e7deab9c0e78eb9cef18ebd8ea7e9eccdbd6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)