To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 帳ο?將??訝?}v帳ο?將??訝?}vB 1001001010100000100000111100110100111111100110111001001000111111001111111110011001100010001111110111110101110110100100101010000010000011110011010011111110011011100100100011111100111111111001100110001000111111011111010111011001000010 92a083cd3f9b923f3fe6623f7d7692a083cd3f9b923f3fe6623f7d7642
EUC-JP 帳ο?將??訝?}v帳ο?將??訝?}vB 1100010010100010101001101100111100111111110101011111001000111111001111111110101111000011001111110111110101110110110001001010001010100110110011110011111111010101111100100011111100111111111010111100001100111111011111010111011001000010 c4a2a6cf3fd5f23f3febc33f7d76c4a2a6cf3fd5f23f3febc33f7d7642
UTF-8 帳ο풁將됵슬訝긃}v帳ο풁將됵슬訝긃}vB 111001011011100010110011110011101011111111101101100100101000000111100101101100001000011111101011100100001011010111101100100010101010110011101000101010001001110111101010101110001000001101111101011101101110010110111000101100111100111010111111111011011001001010000001111001011011000010000111111010111001000010110101111011001000101010101100111010001010100010011101111010101011100010000011011111010111011001000010 e5b8b3cebfed9281e5b087eb90b5ec8aace8a89deab8837d76e5b8b3cebfed9281e5b087eb90b5ec8aace8a89deab8837d7642
UHC 帳ο풁將됵슬訝긃}v帳ο풁將됵슬訝긃}vB 11101101111000111010010111101111101111101000101011101101111000101000100111101111101111011011110111100100101110001000001101000110011111010111011011101101111000111010010111101111101111101000101011101101111000101000100111101111101111011011110111100100101110001000001101000110011111010111011001000010 ede3a5efbe8aede289efbdbde4b883467d76ede3a5efbe8aede289efbdbde4b883467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)