To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 與??餓??蘊??}v與??餓??蘊??}vB 1110010001101111001111110011111110001001111011000011111100111111111001010101110100111111001111110111110101110110111001000110111100111111001111111000100111101100001111110011111111100101010111010011111100111111011111010111011001000010 e46f3f3f89ec3f3fe55d3f3f7d76e46f3f3f89ec3f3fe55d3f3f7d7642
EUC-JP 與??餓??蘊??}v與??餓??蘊??}vB 1110011111010000001111110011111110110010111011100011111100111111111010011011111000111111001111110111110101110110111001111101000000111111001111111011001011101110001111110011111111101001101111100011111100111111011111010111011001000010 e7d03f3fb2ee3f3fe9be3f3f7d76e7d03f3fb2ee3f3fe9be3f3f7d7642
UTF-8 與썽콌餓뽩렘蘊딁윮}v與썽콌餓뽩렘蘊딁윮}vB 1110100010001000100001111110110010001101101111011110110010111101100011001110100110100100100100111110101110111101101010011110101110100000100110001110100010011000100010101110101110010100100000011110110010011100101011100111110101110110111010001000100010000111111011001000110110111101111011001011110110001100111010011010010010010011111010111011110110101001111010111010000010011000111010001001100010001010111010111001010010000001111011001001110010101110011111010111011001000010 e88887ec8dbdecbd8ce9a493ebbda9eba098e8988aeb9481ec9cae7d76e88887ec8dbdecbd8ce9a493ebbda9eba098e8988aeb9481ec9cae7d7642
UHC 與썽콌餓뽩렘蘊딁윮}v與썽콌餓뽩렘蘊딁윮}vB 1110011010101000101111011110100110110001100010001110010010111011100101101110010110110111101111011110100010110011100010101110011110011111101011010111110101110110111001101010100010111101111010011011000110001000111001001011101110010110111001011011011110111101111010001011001110001010111001111001111110101101011111010111011001000010 e6a8bde9b188e4bb96e5b7bde8b38ae79fad7d76e6a8bde9b188e4bb96e5b7bde8b38ae79fad7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)