To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????æ}v???????æ}vB 001111110011111100111111001111110011111100111111001111111110011001111101011101100011111100111111001111110011111100111111001111110011111111100110011111010111011001000010 3f3f3f3f3f3f3fe67d763f3f3f3f3f3f3fe67d7642
SJIS-WIN ??バ?∨?純?}v??バ?∨?純?}vB 001111110011111110000011011011110011111110000001110010010011111110001111100000110011111101111101011101100011111100111111100000110110111100111111100000011100100100111111100011111000001100111111011111010111011001000010 3f3f836f3f81c93f8f833f7d763f3f836f3f81c93f8f833f7d7642
EUC-JP ??バ?∨?純æ}v??バ?∨?純æ}vB 00111111001111111010010111010000001111111010001011001011001111111011110111100011100011111010100111000001011111010111011000111111001111111010010111010000001111111010001011001011001111111011110111100011100011111010100111000001011111010111011001000010 3f3fa5d03fa2cb3fbde38fa9c17d763f3fa5d03fa2cb3fbde38fa9c17d7642
UTF-8 룵혧バ룶∨룵純æ}v룵혧バ룶∨룵純æ}vB 111010111010001110110101111011011001100010100111111000111000001110010000111010111010001110110110111000101000100010101000111010111010001110110101111001111011010010010100110000111010011001111101011101101110101110100011101101011110110110011000101001111110001110000011100100001110101110100011101101101110001010001000101010001110101110100011101101011110011110110100100101001100001110100110011111010111011001000010 eba3b5ed98a7e38390eba3b6e288a8eba3b5e7b494c3a67d76eba3b5ed98a7e38390eba3b6e288a8eba3b5e7b494c3a67d7642
UHC 룵혧バ룶∨룵純æ}v룵혧バ룶∨룵純æ}vB 10001111101010101100001010001111101010111101000010001111101010111010000111111101100011111010101011100010111011011010100110100001011111010111011010001111101010101100001010001111101010111101000010001111101010111010000111111101100011111010101011100010111011011010100110100001011111010111011001000010 8faac28fabd08faba1fd8faae2eda9a17d768faac28fabd08faba1fd8faae2eda9a17d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)