To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??幼??押る?悠??循??? 1110101001000000001111110011111111100011111001010011111100111111100101110110001100111111001111111000100110011111100000101110100100111111100101110100100100111111001111111000111101111010001111110011111100111111 ea403f3fe3e53f3f97633f3f899f82e93f97493f3f8f7a3f3f3f
EUC-JP 鵝??肄??幼??押る?悠??循??繇 11110011101000010011111100111111111001101110011100111111001111111100110111000100001111110011111110110010101000011010010011101011001111111100110110101010001111110011111110111101110110110011111100111111100011111101010011010001 f3a13f3fe6e73f3fcdc43f3fb2a1a4eb3fcdaa3f3fbddb3f3f8fd4d1
UTF-8 鵝숈뮆肄덃끽幼먯춪押る굞悠밭솾循뗭챾繇 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001011011100110111100111010111010100010101111111011001011011010101010111001101000101010111100111000111000001010001011111010101011010110011110111001101000001010100000111010111011000010101101111011001000011010111110111001011011111010101010111010111001011110101101111011001011000110111110111001111011100110000111 e9b59dec8888ebae86e88284eb8d83eb81bde5b9bceba8afecb6aae68abce3828beab59ee682a0ebb0adec86bee5beaaeb97adecb1bee7b987
UHC 鵝숈뮆肄덃끽幼먯춪押る굞悠밭솾循뗭챾繇 1110010010111101100110011110110010010010100101011110110010111101100010001110011010110011101000111110101011101010100100001110110010101101100001111110010011100011101010101110101110000010100001101110101011101101101110011110011110011001101100101110001011100000100010111110110010101010100010111110100110100011 e4bd99ec9295ecbd88e6b3a3eaea90ecad87e4e3aaeb8286eaedb9e799b2e2e08becaa8be9a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)