To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 娃?????澳??}v娃?????澳??}vB 100010001010000100111111001111110011111100111111001111111110000001010011001111110011111101111101011101101000100010100001001111110011111100111111001111110011111111100000010100110011111100111111011111010111011001000010 88a13f3f3f3f3fe0533f3f7d7688a13f3f3f3f3fe0533f3f7d7642
EUC-JP 娃?????澳??}v娃?????澳??}vB 101100001010001100111111001111110011111100111111001111111101111110110100001111110011111101111101011101101011000010100011001111110011111100111111001111110011111111011111101101000011111100111111011111010111011001000010 b0a33f3f3f3f3fdfb43f3f7d76b0a33f3f3f3f3fdfb43f3f7d7642
UTF-8 娃띰쉠樂쒙슁澳뽳숱}v娃띰쉠樂쒙슁澳뽳숱}vB 1110010110101000100000111110101110011101101100001110110010001001101000001110111110100110101111111110110010010010100110011110110010001010100000011110011010111110101100111110101110111101101100111110110010001000101100010111110101110110111001011010100010000011111010111001110110110000111011001000100110100000111011111010011010111111111011001001001010011001111011001000101010000001111001101011111010110011111010111011110110110011111011001000100010110001011111010111011001000010 e5a883eb9db0ec89a0efa6bfec9299ec8a81e6beb3ebbdb3ec88b17d76e5a883eb9db0ec89a0efa6bfec9299ec8a81e6beb3ebbdb3ec88b17d7642
UHC 娃띰쉠樂쒙슁澳뽳숱}v娃띰쉠樂쒙슁澳뽳숱}vB 1110100011011111101101101110111110111101101010101110100011111001100111001110111110111101101100111110011111111110100101101110111110111101101000100111110101110110111010001101111110110110111011111011110110101010111010001111100110011100111011111011110110110011111001111111111010010110111011111011110110100010011111010111011001000010 e8dfb6efbdaae8f99cefbdb3e7fe96efbda27d76e8dfb6efbdaae8f99cefbdb3e7fe96efbda27d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)